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ABSTRACT 


These  proceedings  report  the  results  of  a workshop  on  the  applications  of 
distributed  system  theory  to  the  control  of  large  space  structures  (LSS) , which 
was  held  at  the  Jet  Propulsion  Laboratory,  July  14-16,  1982.  Co-sponsored  by 
the  Jet  Propulsion  Laboratory  and  Langley  Research  Center,  this  workshop  re- 
sponded to  a rapidly  growing  interest  within  NASA  in  developing  the  control 
technology  required  to  make  possible  the  large,  shuttle-based  space  systems 
planned  for  the  1980s  and  beyond.  The  scope  of  this  workshop  encompassed  two 
mutually  complementary  themes,  both  of  which  involve  the  notion  of  a dis- 
tributed system  in  some  sense.  One  theme  was  the  control  theory  for  dis- 
tributed parameter  systems,  in  which  the  traditional  emphasis  is  on  developing 
basic  control  principles  by  means  of  distributed  or  continuum  models.  The 
other  theme  is  that  of  distributed  control  for  systems  requiring  spatially- 
distributed  multipoint  sensing  and  actuation  — whether  described  by  lumped 
or  continuum  models.  Papers  considering  both  of  these  control  theories  were 
presented  in  the  workshop  sessions  on  modeling  and  control,  control  and 
stabilization,  distributed  control,  control  theory  for  distributed  systems, 
and  estimation  and  identification.  The  three  discussion  sessions  held  were 
devoted  to  the  general  topics  of  a summary  of  LSS  control  problems,  LSS 
application  of  distributed  system  theory,  and  future  research  opportunities. 
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PREFACE 


At  the  time  this  workshop  was  being  formulated,  I was  the  manager  at  NASA 
Headquarters  responsible  for  a broad  area  of  electronics  and  human  factors 
research  and  technology  that  included  space  controls  and  guidance.  In  the  fall 
of  1980,  I suggested  to  Dr,  G.  Rodriguez  of  the  NASA  Jet  Propulsion  Laboratory 
and  Dr.  L.  W.  Taylor,  Jr.,  of  the  NASA  Langley  Research  Center  that  they  should 
organize  and  conduct  a workshop  on  this  topic.  My  motivation  and  rationale  are 
presented  in  the  following  paragraphs. 

In  reviewing  the  research  supported  by  my  office,  it  appeared  that  rela- 
tively little  work  was  being  done  in  the  application  of  partial  differential 
equation  (PDE)  control  theory  to  the  large  space  structure  control  problem.  It 
is  well  known  that  certain  large  spacecraft  configurations  with  rather  uniform 
mass  distribution  can  be  most  accurately  modeled  by  a set  of  PDF's.  Such  equa- 
tions, of  course,  are  much  more  difficult  to  treat  than  ordinary  differential 
equations  that  result  from  a finite-element  model  of  a structure.  For  the  most 
part,  researchers  who  have  used  PDE  models  have  made  some  approximations,  either 
in  the  formulation  or  in  the  solution  to  obtain  attractive  control  law  designs. 

Substantial  theory  existed  at  that  time  to  treat  control  of  PDE  systems. 

For  example.  Professor  A.  V.  Balakrishnan  at  UCLA  had  treated  PDE  control  for 
aircraft  flutter  and  had  discussed  the  problem  of  large  space  structure  control. 
It  was  not  clear  to  me  whether  or  not  an  exact  PDE  solution  would  be  better  than 
various  approximate  solutions  when  considering  practical  implementation  con- 
straints. However,  it  seemed  most  appropriate  for  the  NASA  organization  respon- 
sible for  advanced  controls  research  to  examine  the  state-of-the-art  theories 
and  applications  and  assess  whether  more  research  effort  should  be  applied 
towards  the  PDE  approach. 

My  suggestion  for  a workshop  was  to  bring  together  the  key  researchers  from 
both  schools  of  thought  and  discuss  the  status,  problems  and  potential  and  thus 
identify  important  research  opportunities.  The  workshop  that  resulted,  the 
workshop  reported  herein,  had  somewhat  broader  objectives  but  encompasses  my 
original  motivation.  The  main  objectives  of  the  workshop  were: 

1)  To  exchange  ideas  and  explore  the  application  of  various  control 
theories  for  distributed  systems  to  large  space  structures;  and 

2)  To  identify  the  important  unsolved  problems  of  current  interest  lead- 
ing to  possible  future  collaborative  NASA/university /industry  efforts. 

Here  the  term  distributed  systems  was  used  in  two  ways:  1)  to  mean  dis- 

tributed parameter  control  theory  leading  to  PDE  control;  and  2)  to  mean  control 
for  systems  with  spatially  distributed  multipoint  sensors  and  actuators,  whether 
modeleid  with  finite  elements  or  distributed  parameters. 

The  workshop  covered  the  topics  of  modeling  and  control,  control  and  stabil- 
ization, control  theory  for  distributed  systems,  and  estimation  and  identifica- 
tion. There  was  a formal  discussion  period  scheduled  at  the  end  of  each  of  the 
first  two  days  and  a wrap-up  panel  discussion  on  the  final  day.  The  final 
wrap-up  panel  discussion  focused  on  identifying  future  research  opportunities 
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for  NASA  consideration.  Technical  synopses  of  these  discussion  periods,  as  well 
as  a technical  evaluation  of  the  workshop,  are  included  in  this  document. 

One  final  note:  the  present  NASA  Administration  has  stated  publicly  that 
the  permanent  presence  of  man  in  space  is  NASA's  current  primary  goal  for  the 
space  program.  This  effort  should  result  in  the  first  U.S.  large  space  station, 
followed  by  a series  of  other  spacecraft,  platforms,  and  satellites  of  very  large 
dimensions.  Many  of  the  high-potential  space  applications,  both  civil  and  mili- 
tary, depend  on  having  the  technology  to  implement  large  space  structures.  The 
workshop  addressed  a very  timely  and  important  topic  for  the  future  exploitation 
of  space. 


H.  A.  Rediess 
October,  1982 
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MODELING  AND  CONTROL  OF 
DISTRIBUTED  STRUCTURES* 

L.  Meirovitch 

Department  of  Engineering  Science  and  Mechanics 
Virginia  Polytechnic  Institute  and  State  University 
Blacksburg,  VA  24061 


ABSTRACT 

There  appears  to  be  some  incongruity  in  the  design  of  controls  for  struc- 
tures. Structures  are  basically  distributed-parameter  systems,  described  by 
partial  differential  equations,  and  control  theory  is  concerned  almost  exclu- 
sively with  discrete  (in  space)'  systems,  described  by  ordinary  differential 
equations.  The  standard  approach  to  solving  this  dilemma  is  to  discretize  the 
system  in  space,  which  precludes  the  use  of  distributed  controls.  A different 
approach,  known  as  the  independent  modal-space  control  method,  is  designed  to 
eliminate  the  incongruity  by  bringing  about  a closer  correspondence  between 
modeling  and  control  theory.  Indeed,  the  independent  modal-space  control 
method  can  treat  distributed  structures  as  well  as  discretized  models  and  it 
permits  design  of  both  distributed  and  discrete-point  controls. 


INTRODUCTION 

Structures  are  essentially  distributed-parameter  systems  and  their  behav- 
ior is  described  by  partial  differential  equations  (Refs.  1,2).  The  diffi- 
culty in  designing  controls  for  distributed  structures  becomes  immediately 
evident  when  it  is  recognized  that  the  control  theory  is  concerned  for  the 
most  part  with  discrete  systems,  described  by  finite  sets  of  simultaneous 
ordinary  differential  equations,  and  not  with  distributed  systems.  Through  a 
modal  expansion,  it  is  possible  to  transform  the  partial  differential  equation 
governing  the  motion  of  a distributed  structure  into  a set  of  simultaneous 
ordinary  differential  equations,  but  the  set  of  equations  is  infinite,  so  that 
the  question  as  to  how  to  compute  control  gains  for  systems  of  infinite  order 
remains.  To  circumvent  this  difficulty,  an  approach  commonly  used  is  to  dis- 
cretize the  distributed  structure,  l.e.,  to  represent  it  by  a discrete  model 
of  finite  order.  This  approach  has  several  undesirable  effects.  In  the  first 
place,  the  control  design  tends  to  acquire  all  the  characteristics  of  a dis- 
crete system.  For  example,  distributed  controls  and  distributed  sensors  must 
be  ruled  out  immediately,  as  they  do  not  fit  in  a finite-dimensional  vector 
space  formulation  of  modern  control  theory.  Consistent  with  a discrete  (-in- 
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space)  theory,  one  must  control  structures  by  means  of  discrete-point  actua- 
tors and  sensors  (see,  for  example.  Refs.  3-7).  Another  drawback  is  that  a 
discretized  model  is  only  an  approximate  model,  so  that  if  discretization  of  a 
distributed  structure  is  not  done  with  care,  serious  errors  can  be  introduced, 
leading  to  an  unstable  system.  From  the  above,  we  conclude  that  the  choice  of 
the  mathematical  model  is  dictated  by  limitations  in  the  control  theory  and 
not  by  prudent  structural  modeling. 

More  often  than  not  control  of  discrete  (or  discretized)  systems  repre- 
sents one  form  or  another  of  modal  control  (Refs.  8,9).  The  idea  behind  modal 
control  is  that  one  can  control  the  structure  by  controlling  its  modes.  In 
theory,  one  can  control  all  the  modeled  modes  of  a discretized  system  with  a 
single  actuator  and  observe  all  these  modes  with  a single  sensor,  provided  the 
controllability  and  observability  requirements  are  satisfied.  In  practice, 
problems  can  arise.  Some  of  them  relate  to  the  control  system  performance  and 
other  to  the  computational  algorithms  for  the  control  gains.  Similar  problems 
exist  for  multi  input-multi  output  systems.  In  the  case  in  which  a discre- 
tized model  is  used  to  design  controls  for  a distributed  structure,  one  can 
encounter  problems  of  control  and  observation  spillover  (Ref.  3).  These 
problems  can  be  attributed  directly  to  the  Insistence  on  using  discrete  actu- 
ators and  sensors  to  control  a distributed  structure. 

It  is  standard  practice  in  modal  control  to  express  the  state  of  the 
system  in  terms  of  modal  coordinates  and  velocities,  in  which  case  the  formu- 
lation is  in  terms  of  the  so-called  modal  equations  of  motion.  In  the  case  of 
open-loop  controls  the  modal  equations  are  decoupled.  In  feedback  controls, 
however,  the  controls  depend  in  general  on  all  the  controlled  variables,  so 
that  the  feedback  controls  recouple  the  modal  equations  of  motion.  We  refer 
to  this  case  as  coupled  controls . The  approaches  used  in  Refs.  3-7  belong 
basically  in  this  category. 

Another  approach  to  modal  control,  knovm  as  the  Independent  modal-space 
control  method  (Refs.  10-13),  remains  more  faithful  to  the  original  structure. 
Indeed,  the  method  can  be  used  for  distributed  structures  or  for  discretized 
models  of  distributed  structures.  The  method  consists  of  designing  controls 
so  that  the  modal  equations  of  motion  remain  decoupled,  thus  reducing  the 
control  problem  to  the  design  of  modal  controls  for  a set  of  independent 
second-order  modal  equations.  Then,  the  actual  controls  are  synthesized  from 
the  modal  controls  via  a simple  transformation.  Because  the  controls  are 
designed  for  second-order  systems  only,  the  method  permits  a wider  choice  of 
control  techniques  and  most  of  the  problems  associated  with  coupled  controls 
disappear.  It  should  be  pointed  out  that,  in  the  case  of  the  independent 
modal-control  method,  if  one  chooses  to  work  with  a discretized  model,  then 
this  choice  is  dictated  by  the  inability  of  computing  the  entire  infinity  of 
modes  of  the  structure  and  not  by  the  inability  of  designing  distributed 
controls.  Of  course,  if  all  the  modes  of  the  structure  are  known,  then  no 
discretization  is  necessary.  The  independent  modal-space  control  method  is 
ideally  suited  for  distributed  controls,  in  which  case  no  control  spillover 
exists  (Ref.  13) . A similar  statement  can  be  made  regarding  observation 
spillover,  provided  one  is  able  to  measure  (or  estimate)  the  state  at  every 
point  of  the  distributed  structure. 
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This  paper  begins  with  the  derivation  of  the  equations  of  motion  for 
distributed  structures  and  follows  with  a discussion  of  proper  modeling,  with 
special  emphasis  on  the  case  in  which  the  eigenvalue  problem  can  be  solved 
only  approximately.  Some  fine  points  concerning  the  nature  of  the  approximate 
eigensolution  of  distributed  structures  are  presented  and  their  implications 
in  model  discretization  are  explored.  Then,  attention  is  given  to  various 
problems  inherent  in  the  control  of  distributed  structures  by  means  of  dis- 
crete models.  Finally,  the  independent  modal-space  control  method  is  reexam- 
ined in  the  context  of  control  of  distributed  structures. 


EQUATIONS  OF  MOTION  FOR  THE  STRUCTURE 

The  equations  of  motion  for  a distributed  structure  can  be  derived  by 
means  of  the  extended  Hamilton  principle  (Ref.  1 ).  This  requires  expressions 
for  the  kinetic  energy,  the  potential  energy  and  the  virtual  work.  Denoting 
the  displacement  vector  of  any  nominal  point  P in  the  structure  by  u(P,t), 
the  kinetic  energy  can  be  written  as 

T = ^ / mu'^u  dD  (1) 

D 

where  m = m(P)  is  the  mass  density  and  D is  the  domain  of  extension  of  the 
structure.  The  potential  energy  can  be  written  in  the  symbolic  form 

V = •|■[u,  u]  (2) 

where  [u,u]  denotes  an  energy  inner  product  (Ref.  2).  Moreover,  denoting  the 
distributed  force  vector  by  f(P,t),  the  virtual  work  has  the  expression 

6W  = I f"^  6u  dD  (3) 

D ~ 

where  6u  is  the  virtual  displacement  vector.  Note  that  in  the  case  of  feed- 
back controls,  the  force  vector  f does  not  depend  explicit|,y  ^n^the  position  P 
and  time  t but  only  implicitly  tfirough  the  state  vector  [u  u ] . 


The  extended  Hamilton  principle  has  the  form  (Ref.  1 ) 


/ (6L  + 6W)dt  = 0,  6u(P,t)  = 0 , t = t^,t2  ; FED 


(4) 


where  L = T-V  is  the  Lagrangian.  Inserting  Eqs.  (1-3)  into  Eq.  (4)  and  fol- 
lowing the  usual  steps,  we  obtain  the  partial  differential  equation  of  motion 
for  the  structure 
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(5) 


Lu  + Mil  = f , PeD 

where  L is  a differential  operator  matrix  with  entries  of  order  2p  and 
M is  a mass  matrix.  The  displacement  vector  u is  subject  to  the  boundary 
conditions 


Bj^u  = 0 , PeS;  i=l,2,...,p  (6) 

where  B.  are  differential  operator  matrices  with  entries  of  maximum  order  2p-l 
and  S is  the  set  of  points  defining  the  boundary  of  D. 

The  relatively  simple  formulation  (5-6)  may  be  a little  misleading.  In- 
deed, in  general  a structure  represents  an  assemblage  of  substructures  acting 
together  as  a whole,  so  that  the  operator  L may  vary  from  one  substructure  to 
another  and  the  operators  B,  may  vary  from  one  boundary  to  another.  It  should 
be  clear  that  control  of  a Jistributed-parameter  system  using  Eqs.  (5-6)  di- 
rectly is  not  possible,  even  for  a structure  of  moderate  complexity.  This 
points  to  modal  control  as  the  only  viable  alternative.  The  idea  behind  modal 
control  is  that  one  can  control  a structure  by  controlling  its  modes.  Before 
modal  control  can  be  implemented,  it  is  necessary  to  compute  the  modes  of  the 
structure. 


STRUCTURE  DISCRETIZATION 

As  mentioned  above,  the  problem  defined  by  Eqs.  (5-6)  in  general  is  too 
complex  to  permit  closed-form  solution.  This  applies  not  only  to  the  control 
problem  but  also  to  the  open-loop  eigenvalue  problem.  Hence,  we  seek  a solu- 
tion for  the  modes  of  the  structure  by  an  approximate  method,  which  requires 
the  discretization  of  the  structure.  To  this  end,  we  use  the  finite  element 
method,  which  can  be  regarded  as  a variant  of  the  Rayleigh-Ritz  method,  at 
least  in  the  case  of  structures. 

Let  us  assume  that  the  motion  of  the  structure  can  be  described  in  terms 
of  n nodal  coordinates  q.(t)  (j  = l,2,...,n).  In  general,  the  nodal  coordi- 
nates consist  of  both  translations  and  rotations.  Introducing  the  n-vector 
q(t)  = [q^^(t)  q2(t)  ...  q^(t)]  , the  displacement  vector  u(P,t)  can  be  expres- 

ied  in  terms  of  the  nodal  coordinates  by  means  of  the  linear  transformation 

u(P,t)  = L(P)q(t)  (7) 

where  L(P)  is  a matrix  of  interpolation  functions  (Ref.  2 ).  Introducing  Eq. 
(7)  into  Eqs.  (1-2),  we  obtain  the  discretized  kinetic  energy  and  potential 
energy  in  the  form 

T = j q^Mq  , V = | q\q  (8a, b) 


where 
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(9a, b) 


M=  /j^L^(P)ML(P)  dD(P),  K=  [L(P),  L(P)] 


are  n x n mass  and  stiffness  matrices.  Both  M and  K are  symmetric.  Moreover, 
M is  positive  definite  and  K is  nonnegative,  i.e.,  it  can  be  either  positive 
definite  or  positive  semidef inite,  the  latter  being  the  case  when  the  struc- 


ture  admits  rigid-body  modes.  In  addition,  inserting  Eq. 
obtain  the  discretized  virtual  work 

(7)  into  Eq.  (3) 

6W  =Q^6q 

(10) 

where 

Q = / L^’(P)f(P,t)  dD(P) 

(11) 

D 


is  a nodal  force  vector  and  6q  is  the  virtual  nodal  displacement  vector. 

The  extended  Hamilton  principle  remains  in  the  form  (4) , except  that  the 
varied  path  is  now  subject  to  5q  = 0 , t = t^^,  t^.  It  is  not  difficult  to 
show  that  use  of  the  principle, '"in  conjunction  with  Eqs.  (9)  and  (10),  yields 

Mq  + Kq  = Q (12) 

which  represents  a set  of  simultaneous  ordinary  differential  equations  of 
motion.  Hence,  the  effect  of  using  the  finite  element  method  is  to  reduce  a 
distributed  structure  with  an  infinite  number  of  degrees  of  freedom  to  a 
discrete  model  with  only  n degrees  of  freedom.  But,  whereas  Eq.  (12)  has  the 
appearance  of  a discrete  system  it  is  in  fact  only  a discretized  system  meant 
to  represent  a distributed  structure.  Hence,  the  process  leading  from  Eq.  (5) 
to  Eq.  (12)  is  a discretization  and  truncation  process  simultaneously.  Indeed 
it  should  be  very  clear  that  a discretized  model  of  a distributed  structure  is 
already  a truncated  model.  The  question  remains  to  what  extent  the  discre- 
tized model  is  representative  of  the  actual  distributed  structure.  We  propose 
to  examine  this  question  in  the  next  section. 


THE  NATURE  OF  THE  DISCRETIZED  MODEL 


As  pointed  out  above,  the  system  described  by  Eq.  (12)  is  not  truly  a 
discrete  system  but  only  a discretized  and  truncated  model  of  a distributed 
structure.  It  is  commonly  believed  that  by  increasing  n the  discretized  model 
becomes  a completely  faithful  representation  of  the  distributed  structure.  It 
turns  out  that  this  is  a mistaken  belief,  as  we  are  about  to  show. 

Let  us  consider  the  eigenvalue  problem  associated  with  Eq.  (12)  and  write 
it  in  the  form 


k(^)  (n)  ^ ^(n)  ^(n)  (n) 
r -r 


r=l,2, . . • , n 


(13) 
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where  ^ and  q ^ (r=l, 2 , . . . ,n)  are  eigenvalues  and  eigenvectors,  respec- 

tively. The  superscript  (n)  indicates  that  the  eigenvalue  problem  (13)  cor- 
responds to  a finite-dimensional  model  possessing  n degrees  of  freedom.  It  is 
not  difficult  to  show  that  all  the  eigenvalues  are  real  and  nonnegative.  Next, 
let  us  consider  a model  possessing  n + 1 degrees  of  freedom  and  denote  the 
associated  eigenvalue  problem  by  the  superscript  (n+1) , so  that 


j,(n+l)  (n+1) 

Sr 


_ ^ (n+1 )^ (n+1)  (n+1) 
^r  Sr 


r=l, 2, . . . ,n+l 


(14) 


The  question  arises  as  to  how  the  eigensolution  of  the  n-degree-of-freedom 
model  relates  to  the  eigensolution  of  the  (n+l)-degree-of-freedom  model. 

The  answer  is  provided  by  the  inclusion  principle  (Ref.  2 ).  Assuming  that 
the  two  sets  of  eigenvalues  are  arranged  in  ascending  order  of  magnitude. 


< A 


(n+1) 


n+1 


the  inclusion  prin- 


< and  . 

- n 1 - 2^^  - 

ciple  states  that  the  eigenvalues  A''  (r=l,2, . . . ,n+l)  bracket  the  eigen- 
values A^'^  (r=l,  2, . . . ,n) , or  ^ 


^(„+l)  ^ ^(n)  ^^(n+1)  ^ ^(n)  ^ 


^(n+1)  ^ ^(n)  ^ ^(n+1) 
n — n — n+1 


(15) 


The  eigenvalues  A^'^^  , A^'^^  , . . . , > computed  on  the  basis  of  the  dis- 

cretized model,  are  only  approximations  to  the  lowest  n eigenvalues  A^^,  X^, 
...,  A^  of  the  actual  structure.  Hence,  the  question  arises  as  to  how  the 

"computed  eigenvalues"  of  the  discretized  model  relate  to  the  "actual  eigen- 
values" of  the  distributed  structure.  In  this  regard,  it  can  be  stated  that 
(Ref.  2 ) 


lim  A^*^^  = A^,  r=l,2,...,n 

n “ 


(16) 


Moreover,  the  computed  eigenvalues  approach  the  actual  eigenvalues  from  above 
as  n tends  to  infinity. 


The  fact  that  the  computed  eigenvalues  approach  the  actual  eigenvalues 
asymptotically  as  n ->  «>  is  very  reassuring,  but  in  practice  n is  finite  and 
not  infinite.  As  it  turns  out,  when  n is  finite  the  situation  is  not  nearly 
as  good.  In  particular,  whereas  the  lower  computed  eigenvalues  tend  to  be 
relatively  good  approximations  to  the  actual  eigenvalues,  accuracy  deterio- 
rates as  the  mode  number  increases  to  the  extent  that  the  higher  computed 
eigenvalues  can  be  grossly  in  error,  sometimes  the  error  being  of  the  order  of 
several  hundred  percent.  Paradoxically,  this  is  true  regardless  how  large  n 
is.  Indeed,  increasing  ^ more  computed  eigenvalues  tend  to  become  accurate , 
but  at  the  same  time  new  computed  eigenvalues  are  added  at  the  upper  end  of 
the  spectrum,  and  the  new  ones  tend  to  be  wildly  in  error . Hence,  iw  dis- 
cretized model  can  yield  ^ totally  accurate  representation  of  a distributed 
structure.  This  fact  must  be  taken  into  account  in  designing  a control  sys- 
tem, as  the  ob.i ect  is  to  control  the  structure  Itself  and  not  the  model. 
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MODAL  EQUATIONS  OF  MOTION  FOR  CONTROL 

In  one  form  or  another,  virtually  all  approaches  to  the  control  of 
structures,  whether  distributed  or  discrete,  are  based  on  modal  control. 

The  idea  is  that  if  one  controls  the  modes  of  a structure,  then  in  essence 
one  controls  the  structure.  To  design  modal  controls,  it  is  convenient  to 
transform  the  nodal  equations  of  motion  into  modal  equations  of  motion. 

This  necessitates  the  structure  eigenvalue  and  eigenfunctions,  which  in  turn 
requires  the  solution  of  the  eigenvalue  problem  for  the  structure. 

More  often  than  not,  no  closed-form  solution  of  the  eigenvalue  problem 
exists,  so  that  an  approximate  solution  is  frequently  necessary.  This 
approximate  eigensolution  is  based  on  a discretized  system,  and  is  expressed 
in  fact  by  Eq.  (13).  Hence,  dropping  the  superscript  (n) , we  denote  the 
computed  eigenvalues  by  X and  the  computed  eigenvectors  by  q (r=l, 2 , . . . ,n) . 
The  computed  eigenvectors  can  be  used  to  determine  the  computed  eigenfunc- 
tions cj)(P)  associated  with  the  n lowest  modes  of  vibration  by  writing 

$^(P)  = L(P)q^,  r=l,2,...,n  (17) 


where  L(P)  is  the  matrix  of  interpolation  functions  of  Eq.  (7). 

Because  the  structure  is  self-adjoint,  the  eigenfunctions  (j)  (P)(r=l,2, 

. . . ,n)  are  orthogonal.  The  orthogonality  conditions  can  be  demonstrated  by 
recalling  that  the  eigenvalue  problem  (13)  is  defined  in  terms  of  two  real 
symmetric  matrices,  at  least  one  of  which  is  positive  definite.  The  impli- 
cation is  that  the  eigenvectors  q (r=l, 2, . . . ,n)  are  orthogonal  (Ref.  2). 
They  can  be  normalized  so  as  to  satisfy 


q^  Mq  = 6 , q'^Kq  = X 6 = 0)^6  , r,s=l,2, 

3s  3r  rs  3s  3r  r rs  r rs 


(18a, b) 


where  OJ  are  the  computed  natural  frequencies.  Then,  considering  Eqs.  (9a) 
and  (17^  one  can  write 


/ M(|)  dD  = / q^  L'^(P)ML(P)q  dD  = q^Mq  (19) 

so  that,  in  view  of  Eq.  (18a),  the  eigenfunctions  satisfy  the  orthonormality 
relations 


/ dD  = 6^^,  r,s  = l,2,...,n 


Moreover,  Eqs.  (9b)  and  (17)  permit  us  to  write 
[$g  , = qs[L(P),  L(p)]q^  = q\q^ 


(20) 


(21) 
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so  that,  in  view  of  Eq.  (18b),  the  eigenfunctions  also  satisfy  the  orthorma- 
lity  relations 


^$s’  " ^r^rs’  r,s=l,2, . . . ,n  (22) 


Observing,  however,  that  the  operator  L in  Eq.  (5)  is  related  to  the  energy 
inner  product  by 

/ v^iu  dD  = [v,  u]  (23) 

it  follows  that  the  orthormality  relations  (22)  can  be  replaced  by 


/ (j)  Lcj)  dD  = A 6 
•'^Xs  Xr  r rs 


r,s=l,2, • . . ,n 


(24) 


Note  that  the  relation  (23)  is  obtained  via  integrations  by  parts  during 
which  the  boundary  conditions  (6)  are  used  (Ref.  2 ). 

From  the  computed  eigenvalues  A and  computed  eigenfunctions  (f)  (P)(r=l,2, 
....,n)  only  a fraction  can  be  expecied  to  be  accurate  estimates  of  the  true 
eigenvalues  and  eigenfunctions  of  the  distributed  structure.  As  a rule  of 
thumb,  less'  than  one  half  of  the  computed  ones  are  accurate.  We  consider  the 
case  in  which  we  are  interested  in  a discretized  mathematical  model  with  N 
degrees  of  freedom  for  control  design,  so  that  we  must  insist  that  at  least  N 
computed  eigenvalues  and  eigenvectors  are  accurate.  This  implies  that  the 
algebraic  eigenvalue  problem  to  be  solved  must  be  of  order  n » N.  Then,  we 
wish  to  expand  the  displacement  vector  u(P,t)  into  the  series 

N 

u(P,t)  = E (j)  (P)u  (t)  (25) 

r=l  ~ 

where  u (t)  are  known  as  modal  coordinates.  Equation  (25)  is  known  as  the 
expansion  theorem  (Ref.  2).  Introducing  the  modal  matrix 


4)(P)  = [$^(P)  ••• 

and  the  modal  vector 

u(t)  = [Uj^(t)  U2(t)  ...  u^(t)]^  (27) 

Eq.  (25)  can  be  rewritten  in  the  matrix  form 

u(P,t)  = $(P)u(t)  (28) 
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The  modal  matrix  $(P)  permits  us  to  write  all  the  orthonormality  relations, 
Eqs.  (20)  and  (24),  in  the  compact  form 


/q  dD  = I,  dD  = A 


(29a, b) 


where  I is  the  unit  matrix  of  order  N and 


2 2 2 

A = diag  = diag[tOj^  . . . oo^]  (30) 

is  the  diagonal  matrix  of  eigenvalues,  ^^ere  A is  also  of  order  N.  Introducing 
Eq.  (28)  into  Eq.  (5),  multiplying  by  $ , integrating  over  the  domain  D and 
considering  Eqs.  (29)  , we  obtain  the  modal  equations 

u(t)  + Au(t)  = f(t)  (31) 

where 

f(t)  = [f^(t)  f2(t)...fj^(t)]^  = / f^’(P)f(P,t)  dD  (32) 

is  the  N-dimensional  modal  control  vector.  Note  that  the  components  of  the 
modal  control  vector  have  the  explicit  expressions 


f (t)  = / r(P)f(P,t)  dD  , r = 1,2 N 

D ~ 


(33) 


The  control  problem  consists  of  designing  a force  vector  f(P,t)  so  as  to 
drive  any  undesirable  disturbance  to  zero. 


COUPLED  MODAL  CONTROL 

Although  the  modal  equations  (31)  have  the  appearance  of  a set  of  indepen- 
dent equations,  they  are  in  fact  not  independent.  Indeed,  the  notation  f(t) 
for  the  modal  control  vector  is  misleading,  as  for  feedback  control  the 
vector  f depends  only  implicitly  on  time  and  it  depends  explicitly  on  the 
system  state,  or 

f = f(u(t),  u(t))  (34) 

In  the  general  case,  the  feedback  control  vector  f couples  the  modal  equations 
of  motion.  We  refer  to  this  case  as  coupled  modal  control.  In  using  a 
discrete  system  to  design  controls  for  a distributed  system,  the  controls 
acquire  discrete  characteristics.  As  a result,  it  is  impossible  to  design  a 
distributed  control  vector  f (P , t ) control  ^ system  in  coupled  form. 

Hence , ^ the  case  of  coupled  controls  we  must  use  discrete  actuators. 

In  view  of  the  above,  we  assume  that  the  control  is  implemented  by  means 
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of  M actuators  acting  at  the  discrete  points  P=P.  (j=l, 2, . . . ,M) . Discrete 
forces  can  be  treated  as  distributed  by  writing  ^ 


M 

f(P,t)  = S F (t)6(P-P  ) (35) 

j=l  ^ 


where  6(P-P.)  are  spatial  Dirac  delta  functions  and  F.(t)  are  actuator  force 
amplitude  victors.  Introducing  Eq.  (35)  into  Eq.  (32§,  we  obtain  the  modal 
controls 


M 

,(t)=  E 


M 


r(P)F  (t)6(P-P  ) dD=  E 
j = l D J j=l 


$^(P.)F.(t),  r=l,2, 


(36) 


and  we  observe  that  the  index  r in  Eqs.  (36)  has  no  upper  limit,  at  least  in 
theory.  This  implies  that  the  actuator  forces  F.  (j=l,2, . . . ,M)  generate  not 
only  modal  control  forces  f (r=l, 2 , . . . ,N)  acting  on  the  modeled  modes  but 
also  modal  forces  f (r=N+l,fi+2, . . . ) acting  on  the  higher  unmodeled  modes. 
This  fact  is  referred  to  as  control  spillover  into  the  unmodeled  modes.  The 
assumption  is  that  the  number  N ^ modeled  modes  is  sufficiently  large  that 
control  spillover  into  the  unmodeled  modes  is  insignificant . 

Introducing  the  actual  control  vector 


F(t)  = [F^(t)  F^(t)  ...  F^(t)]^  (37) 

and  the  modal  participation  matrix  for  the  modeled  modes 

[$^(Pjl)  <^^(P2)  •••  (38) 


the  modal  control  vector  f can  be  written  in  terms  of  the  actual  control 
vector  F in  the  form 

f(t)  = BF(t)  (39) 

so  that  the  modal  equations,  Eq.  (31),  become 

u(t)  + Au(t)  = BF(t)  (40) 

In  the  above  formulation,  it  is  assumed  that  there  are  N modeled  modes. 
Yet,  one  may  wish  to  control  only  a smaller  number  of  these  modes.  Hence,  we 
wish  to  distinguish  between  two  classes  of  modeled  modes,  namely,  controlled 
modes  and  uncontrolled  modes,  where  the  latter  are  referred  to  at  times  as 
residual  modes . We  denote  the  controlled  and  residual  modes  by  the  subscripts 
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C and  R,  respectively,  and  assume  that  there  are  controlled  modes  and  N„ 
residual  modes,  = N.  Hence,  Eq.  (40)  can  be  written  in  the  parti- 

tioned form 


1 

1 

1 

0 

~c 

+ 

0 

1 

“l 

1 

1 

^R 

L ^ J 

where  the  notation  is  self-evident. 

In  modal  control,  the  actual  feedback  control  vector  F is  assumed  to 
depend  on  the  modal  displacements  and  modal  velocities  corresponding  to  the 
controlled  modes  only,  or 


F = F (u^,  u^)  (42) 

The  force  vector  F can  be  determined  by  various  techniques,  such  as  pole 
allocation  (Refs.~8,9)  or  optimal  control  (Refs.  14,15).  We  discuss  these 
techniques  later  in  this  paper. 

Although  the  actuators  force  vector  F is  designed  so  as  to  regulate  the 
controlled  modes  only,  in  general  these  forces  will  excite  the  residual 
modes.  Indeed,  from  Eq.  (39),  we  can  write 

f^  = B^F  , f^  = B^F  (43a, b) 

Equation  (43b)  shows  clearly  that  there  are  modal  forces  acting  on  the 
residual  modes,  giving  rise  to  so-called  control  spillover  into  the  residual 
modes  (Ref.  3).  Some  attempts  have  been  made  to  suppress  the  residual  modes 
(Ref.  5),  but  questions  of  implementation  remain.  Note  that  to  eliminate 
this  spillover,  one  must  design  F so  that  f = 0. 

For  linear  controls,  the  actual  controls  are  assumed  to  be  proportional 
to  the  modal  displacements  and  velocities,  or 

~ ~ ^1~C  ^2~C 


where  and  are  control  gain  matrices  that  can  be  determined  by  one  of 
the  methods  mentioned  above.  This  permits  us  to  write  the  closed-loop  modal 
equations  in  the  form 


r-B_G.  : 

0 

+ 

1 

o 

H 

h 

--B„G,  I 

0 

-R 

R 1 [ 

^C_ 

_L 

*c-°c°2  ; ° 

u 

• 

u 

tR 


= 0 


(45) 
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It  is  clear  from  Eq.  (45)  the  closed-loop  poles  of  the  controlled  modes  are 
determined  by  ^nd  and  the  closed-loop  poles  of  the  residual 

modes  are  the  same  as  the  open-loop  poles  of  the  residual  modes,  so  that  the 
latter  are  not  affected  by  the  controls.  It  follows  that  control  spillover 
into  the  residual  modes  cannot  destabilize  these  modes.  Although  not  specif- 
ically included  here,  the  same  conclusion  is  valid  for  the  unmodeled  modes. 


CONTROL  IMPLEMENTATION  USING  OBSERVERS 

As  pointed  out  by  Eq.  (42),  or  Eq.  (44),  to  generate  the  actual  feedback 
forces,  one  must  know  the  modal  displacements  and  velocities  associated  with 
the  controlled  modes  at  all  times.  Sensors,  however,  measure  actual  displace- 
ments and  velocities,  so  that  the  question  remains  as  to  how  to  extract  the 
modal  displacements  and  velocities  from  actual  measurements.  It  is  customary 
to  assume  that  sensors  are  discrete  devices  producing  measurements  at  discrete 
points  in  the  structure.  Later  in  this  paper,  we  will  reexamine  this  assump- 
tion. 


Let  us  consider  the  case  in  which  K sensors  measure  displacements  and 
velocities  at  the  discrete  points  P=P^  (i=l,2, . . . ,K) . Denoting  the  sensors 
signals  by  y.(t),  we  can  write  for  displacements  and  velocity  measurements 


y (t)  = u(P  ,t),  y. (t)  = u(P. ,t),  i=l,2,...,K 

~J-  ~ i.  ~JL  ~ J. 


(46a, b) 


respectively.  Once  again  we  propose  to  ignore  the  unmodeled  modes  on  the 
grounds  that  their  contribution  to  the  overall  motion  is  insignificant,  so 
that  the  measurements  are  assumed  to  contain  contributions  from  the  controlled 
and  residual  modes  only.  Inserting  Eq.  (28)  into  Eqs.  (47),  we  obtain 

Yj_(t)  = $(P^)u(t),  y^^(t)  = f(P^)u(t),  1=1, 2,..., K (47a, b) 


where  'l(P.)  is  the  modal  matrix  evaluated  at  P=P.,  u(t)  is  the  modal  displace- 
ment vector  and  u(t)  is  the  modal  velocity  vector.  ^Introducing  the  matrix 


4>(P2> 

♦<p^) 


‘A  i SI 


(48) 


where  the  partitioning  is  obvious,  as  well  as  the  measurement  vector 


y(t)  = [y^(t) 


T 


y^(t)  ...  y^(t)] 


(49) 


Eqs.  (47)  can  be  rewritten  as 


y(t)  = Cu(t)  = C^u^(t)  + Cj^Uj^(t),  y(t)  = Cu(t)  = (t)+Cj^Uj^(t) 

(50a, b) 

For  feedback  control,  we  only  need  and  u^(t),  which  can  be  extracted 

from  measurements  of  actual  displacements  and  velocities  by  means  of  an 
observer,  or  state  estimator  (Ref.  16).  To  this  end,  it  is  convenient  to  re- 
write the  equations  of  motion  in  state  form.  Hence,  let  us  define  the  con- 
trolled and  residual  modal  state  vectors 


v^ 


T I T X 

flic 


.X  t x.T 

Yr  = tl^R  : ^r] 


(51a, b) 


the  associated  modal  control  vectors 


V, 


[fj  |0^]^ 


T I T 

!r  = [!r  ! 2 ^ 


the  coefficient  matrices 


1 
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1 

1 

1 

1 

O 

1 

A = 

1 

1 
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1 

"^R  1 

o 

1 

1 

M 

1 

1 

1 o 

1 

where  I are  identity  matrices  of  appropriate  order,  and 


f 

R — 

1 

pi\ 
pq  1 
1 ! 

0 

’ \ 

. 0 J 

Then,  Eq.  (41)  can  be  rewritten  in  the  state  form 


(52a, b) 


(53a, b) 


(54a, b) 


(55) 


as  indicated  by  Eq.  (42),  the  actual  control  vector  F depends  on  the  controlled 
state,  or 


(56) 
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For  linear  control,  we  obtain  from  Eq.  (44) 


F = 


(57) 


where 


Gj  ■ [Gi  i G^J  (58) 

is  the  control  gain  matrix.  The  purpose  of  the  observer  is  to  extract  the 
state  vector  v^  from  the  system  output,  namely,  the  measurement  vector  y. 
However,  the  output  vector  y contains  contributions  not  only  from  the  con- 
trolled state  V but  also  from  the  residual  state  v_,  so  that 


y(t)  = c'v  + c'v  (59) 

T t 

where  and  C are  matrices  representing  combinations  of  and  respec- 
tively, and  null  matrices,  depending  on  the  type  of  measurements  used. 


Observers  are  dynamical  systems  similar  to  the  actual  systems  and  receiv- 
ing as  input  the  output  of  the  actual  system.  The  observer  eigenvalues  are 
chosen  so  that  the  observer  state  approaches  the  state  of  the  actual  system 
asymptotically.  The  term  actual  system  must  be  interpreted  here  as  the 
discretized  model  and  not  the  distributed  system.  Observers  can  be  designed 
in  various  forms,  but  their  main  features  are  the  same  for  all  designs.  We 
choose  the  observer  in  the  form 


1 

> 

o 

o 

1 

^<J  >' 

O 

rt 

1 

_1_ 

1 

o 

. > 



_«R 

/X  J I 

y(t)  = Cj.v^(t)  + Cj^v^(t) 


F(t) 


[y(t)  - y(t)] 


(60a) 


(60b) 


where  v and  v_  are  estimates  of  v„  and  v_,  respectively,  K and  K_  are  obser- 

~X\.  ~xv  C>  K.  /s 

ver  gain  matrices,  chosen  so  that  v approaches  v exponentially,  and  y 

is  the  observer  output.  An  observer  of  the  type  (60)  is  known  as  a Luenberger 
observer  (Ref.  16).  Equations  (60)  are  similar  to  Eqs.  (13)  of  Ref.  3,  with 
the  exception  that  here  the  residual  modes  are  Included  in  the  observer 
dynamics  and  system  output,  whereas  they  are  left  out  in  Ref.  3.  Excluding 
the  residual  modes  from  the  observer  dynamics  and  system  output  is  equivalent 
to  treating  them  as  unmodeled.  We  examine  the  implications  of  this  shortly. 


In  the  feedback  control  vector  F depends  on  the  controlled  state  v . 
However,  the  observer  produces  the  estimated  state  v and  not  the  actual 
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Eqs.  (47)  can  be  rewritten  as 


y(t)  = Cu(t)  = C^u^(t)  + Cj^u^(t),  y(t)  = Cu(t)  = C^u^(t)+Cj^Uj^(t) 

(50a, b) 

For  feedback  control,  we  only  need  u^(t)  and  u (t) , which  can  be  extracted 
from  measurements  of  actual  displacements  and  velocities  by  means  of  an 
observer,  or  state  estimator  (Ref.  16).  To  this  end,  it  is  convenient  to  re- 
write the  equations  of  motion  in  state  form.  Hence,  let  us  define  the  con- 
trolled and  residual  modal  state  vectors 


T I t,T 


T I t.T 

~X\.  I ~i\ 


(51a, b) 


the  associated  modal  control  vectors 


TO  T T 

V = [f^  ; o"]"- 


IR 


T I T 


(52a, b) 


the  coefficient  matrices 
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where  1 are  identity  matrices  of  appropriate  order,  and 


(53a, b) 


«C  = 


1 

1 

1 td 
1 r\ 
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1 

Fb_  1 

■Q  — 

I 

DCJl 
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I 

L oj 

1 

I 

o l 

I 

1_ 

(54a, b) 


Then,  Eq.  (41)  can  be  rewritten  in  the  state  form 


(55) 


as  indicated  by  Eq.  (42),  the  actual  control  vector  F depends  on  the  controlled 
state,  or 


(56) 
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For  linear  control,  we  obtain  from  Eq.  (44) 


F = 


(57) 


where 

Gc  ■ [=1  i «2l  (58) 

is  the  control  gain  matrix.  The  purpose  of  the  observer  is  to  extract  the 
state  vector  v^  from  the  system  output,  namely,  the  measurement  vector  y. 
However,  the  output  vector  y contains  contributions  not  only  from  the  con- 
trolled state  V but  also  from  the  residual  state  v_,  so  that 


y(t)  = c'v  + c'v  (59) 

Vo  ~ Vj  ^ 

f t 

where  and  C are  matrices  representing  combinations  of  and  respec- 
tively, and  null  matrices,  depending  on  the  type  of  measurements  used. 


Observers  are  dynamical  systems  similar  to  the  actual  systems  and  receiv- 
ing as  input  the  output  of  the  actual  system.  The  observer  eigenvalues  are 
chosen  so  that  the  observer  state  approaches  the  state  of  the  actual  system 
asymptotically.  The  term  actual  system  must  be  interpreted  here  as  the 
discretized  model  and  not  the  distributed  system.  Observers  can  be  designed 
in  various  forms,  but  their  main  features  are  the  same  for  all  designs.  We 
choose  the  observer  in  the  form 


yc(t) 

O 

c 
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ft 
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O 1 
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^R(t) 

Yr(8)_ 

/r^ 

/r 

[y(t)  - y(t)] 


(60a) 


y(t)  = C^v^(t)  + Cj^Vj^(t) 


(60b) 


/S 

where  v and  v_  are  estimates  of  v and  v_,  respectively,  K and  K are  obser- 
~K.  -n/L»  ~K.  U K. 

ver  gain  matrices,  chosen  so  that  v^  approaches  v^^  exponentially,  and  y 

is  the  observer  output.  An  observer  of  the  type  (60)  is  known  as  a Luenberger 
observer  (Ref.  16).  Equations  (60)  are  similar  to  Eqs.  (13)  of  Ref.  3,  with 
the  exception  that  here  the  residual  modes  are  included  in  the  observer 
dynamics  and  system  output,  whereas  they  are  left  out  in  Ref.  3.  Excluding 
the  residual  modes  from  the  observer  dynamics  and  system  output  is  equivalent 
to  treating  them  as  unmodeled.  We  examine  the  implications  of  this  shortly. 

In  the  feedback  control  vector  F depends  on  the  controlled  state  v . 
However,  the  observer  produces  the  estimated  state  v and  not  the  actual 
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observability  matrix 


0 = 


,T 


I rp  I 

I I 

• I 


(68) 


we  can  state  the  theorem:  For  the  state  ^ ^ completely  observable , it 

is  necessary  and  sufficient  that  the  matrix  observability  matrix  0 be  of 
rank  2N  (Ref.  16).  In  the  case  under  consideration,  the  observability  matrix 
reduces  to 


r^T  I 

, T 

1 

0 

1 0 

1 

0 i 

_ II 
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1 

1 

1 

A^c 

I 
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N 
' A ' 


(69) 


Hence,  the  state  v^  ^ completely  observable  if  no  column  of  ^ zero. 
Physically,  this  implies  that  lack  of  observability  can  occur  only  if  all  the 
sensors  are  located  at  the  nodal  points  of  a given  mode.  Clearly,  the  problem 
of  observability  disappears  as  the  number  of  sensors  increases . 


MODAL  FILTERS  (IN  SPACE) 

As  shown  earlier,  the  purpose  of  a Luenberger  observer  is  to  generate  an 
estimate  v^^  of  the  modal  state  v^,  from  the  system  output.  But  a Luenberger 
observer  is  strictly  a discrete^system  device,  which  does  not  consider  the 
distributed  nature  of  a structure.  In  particular,  by  writing  the  output  in 
the  form  (50),  no  advantage  is  taken  of  the  structure  characteristics,  as 
reflected  in  the  orthogonality  of  modes.  It  turns  out  that,  by  taking  advan- 
tage of  the  orthogonality  of  modes,  the  modal  state  can  be  estimated  in  a 
more  direct  way,  thus  obviating  the  need  for  a Luenberger  observer.  Or,  by 
measuring  displacements  alone,  one  can  use  the  same  direct  procedure  to 
estimate  the  modal  displacements  and  then  use  a Luenberger  observer  to  esti- 
mate the  modal  velocities. 

Equation  (25),  expanding  the  displacement  vector  u(P,t)  in  a series  of 
the  structure  eigenfunctions  multiplied  by  modal  coordinates,  is  only  the 
first  part  of  the  expansion  theorem.  The  second  part  of  the  expansion  theorem 
relies  on  the  orthonormality  of  modes,  Eqs.  (20),  and  permits  the  computation 
of  the  modal  coordinates  from  the  actual  displacements,  as  follows: 

u„(t)  = / $3(P)Mu(P,t)  dD,  u^(t)  = / $^(P)Mu(P,t)  dD,  r=l,2,...(70a,b) 

r D"“r  - 

Equations  (70)  filter  out  the  modal  coordinates  u (t)  and  modal  velocities 
u (t)  (r=l,2,...)  from  measurements  of  the  actual  displacement  vector  u(P,t) 
and  actual  velocity  vector  u(P,t)  at  every  point  P of  the  structure  and  for 
all  times  t.  Hence,  they  can  be  regarded  as  (spatial)  modal  filters . It 
should  be  pointed  out  that  in  the  case  of  distributed  measurements,  as  postu- 
lated in  Eqs.  (70),  observability  is  guaranteed  by  definition.  Indeed, 
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observability  is  a concept  peculiar  to  discrete  sensors,  particularly  when 
their  number  is  small. 

In  the  case  in  which  distributed  measurements  are  not  available,  we  must 
consider  the  problem  of  estimating  u^(t)  and  u^(t)  (r=l,2 , . . . ,N^)  from  discrete 
measurements.  To  this  end,  let  us  assume  that  there  are  K sensors  capable  of 
measuring  displacements,  velocities,  angular  displacements  and  angular  veloci- 
ties at  the  discrete  points  P = (1-1,2, ... ,K) . Then,  the  question  can  be 

posed  simply  as  to  thg  number  of  measurements  necessary  to  generate  accurate 
estimates  u(P,t)  and  d(P,t)  of  u(P,t)  and  u(P,t).  This  is  not  a new  question, 
as  the  question  has  been  asked  Jrequently  In  the  finite  element  method.  In 
particular,  the  question  is  to  the  number  of  finite  elements  that  must  be 
used  to  approximate  the  displacement  profile  of  a certain  distributed  struc- 
ture, given  a set  of  interpolation  function  and  the  values  of  the  displacement 
function  at  the  boundaries  of  the  finite  elements.  Of  course,  the  number 
depends  on  the  type  of  functions  to  be  approximated,  the  type  of  interpolation 
functions  and  the  desired  accuracy.  In  our  case,  we  wish  to  generate  approxi- 
mate functions  u(P,t)  and  u(P,t)  of  sufficient  accuracy  to  permit  accurate 
estimates  u (t)~and  u (t)  of  the  first  modal  displacements  u^(t)  and  modal 
velocities  u (t) . Hence,  in  the  case  of  discrete  measurements,  the  modal 
filters  can  £e  written  in  the  form 


u^(t)  = / u(P,t)dD,  u (t)  = / $^(P)M  u(P,t)dD,  r=l,2,...,N 


(71a,b) 


To  illustrate  the  procedure,  let  us  assume  that  the  domain  D is  divided  into 
K-1  elements  D. (1=1,2 ,... ,K-1)  and  denote  the  displacement  and  velocity  vec- 
tors in  D by  u^(P,t)  and  u(P,t).  Moreover,  denoting  the  vector  of  nodal 
displacement  measurements  at  the  boundaries  of  by  y^(t) , where  y^(t) 
includes  translational  and  rotational  coordinates,  we  can  write 

u^(P,t)  = L^(P)y^(t),  u^(P,t)  = J(P)y^(t),  P e D^,  i=l,2 , . . . ,K-1 


(72a, b) 

where  L(P)  is  a matrix  of  interpolation  functions  of  the  same  type  as  that 


used  in  Eq 

. (7). 

Then,  Eqs.  (71)  can 

be  replaced  by 

/S 

K-1 

m 

K-1 

u (t) 
r 

= E 
i=l 

/ 4>^(p)Mu  (p,t)dD  = 

D.~ 

1 

S , r=l,2, . . . 

(73a) 

K-1 

rri  ■J' 

K-1  T . 

u (t) 
~r 

= E 
i=l 

f $ (P)Mu. (P,t)dD  = 

D_~r 

1 

^ JriXi^*^^  ’ r=l,2, . . . ,N^ 
i=l"' 

(73b) 

where 
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(74) 


I^i  = I $^(P)ML^(P)dD 

Di  ~ 

are  integrals  that  can  be  computed  off-line.  Hence,  the  entire  process  of 
estimating  modal  coordinates  and  velocities  has  been  reduced  to  simple 
summation  of  vector  products.  These  estimates  can  be  produced  almost  instant- 
aneously, as  soon  as  the  measurements  become  available. 

It  is  perhaps  appropriate  to  point  out  that  the  use  of  discrete  measure- 
ments in  conjunction  with  the  interpolation  scheme  described  above  results  in 
estimated  measurements  that  must  be  regarded  as  distributed.  Hence,  observa- 
bility of  the  first  modes  is  virtually  guaranteed. 

As  mentioned  earlier,  if  only  one  type  of  measurement  is  available,  such 
as  displacements  only  or  velocities  only,  then  one  can  use  independent 
second-order  modal  Luenberger  observers  to  estimate  the  other  type  of  variables 
needed  to  complete  the  estimate  v of  the  state  v„. 

~ L/ 

A comparison  between  Luenberger  observers  and  modal  filters  appears  in 
order.  Clearly,  Luenberger  observers  must  be  used  when  only  a limited  number 
of  sensors  is  available.  In  fact,  Luenberger  observers  can  be  used  in  con- 
junction with  a single  sensor.  Questions  of  observability  arise,  however,  as 
the  location  of  the  single  sensor  can  be  critical.  Then,  there  is  the  ques- 
tion of  selecting  observer  gains,  for  which  there  are  no  good  criteria.  More- 
over, convergence  requires  a little  time,  depending  on  the  observer  poles. 

By  contrast,  modal  filters  require  a larger  number  of  sensors,  the  number 
depending  on  the  highest  mode  targeted  for  control,  as  the  number  must  be  such 
as  to  permit  reasonably  accurate  finite  element  approximation  to  that  mode. 

If  the  number  of  sensors  presents  no  objections , modal  filters  are  likely  to 
be  far  superior  to  Luenberger  observers , as  they  represent  ^ concept  that 
takes  full  advantage  of  the  dynamic  characteristics  of  the  distributed 
structure.  Indeed,  modal  filters  screen  out  quite  efficiently  contaminations 
from  the  higher  uncontrolled  modes.  The  reason  for  this  lies  in  the  orthogon- 
ality property,  so  that  higher  modes  are  filtered  out  regardless  whether  they 
are  known  or  not.  Hence,  clear  advantages  of  modal  filters  are  that  they  are 
virtually  free  of  observation  spillover  and  that  the  relatively  large  number 
of  sensors  used  precludes  questions  of  observability . Finally,  modal  filters 
involve  only  simple  on-line  operations,  such  as  additions  and  multiplications, 
so  that  they  provide  virtual  instantaneous  estimation  of  the  modal  coordinates 
and  velocities  required  for  feedback. 


INDEPENDENT  MODAL-SPACE  CONTROL  (IMSC) 

The  independent  modal-space  contract  method  is  basically  a method  for  con- 
trolling structures,  or  any  other  system  admitting  independent  modes.  It  can 
be  used  for  distributed  systems,  for  discretized  models  of  distributed  systems, 
or  for  purely  discrete  systems  (if  Indeed  such  systems  do  exist) . When  used 
for  discretized  models,  it  is  for  entirely  different  reasons  than  in  the  case 
of  coupled  controls.  Indeed,  in  coupled  controls  a finite-dimensional  model 
must  be  used  because  the  control  gain  matrix  must  have  finite  dimensions.  On 
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the  other  hand,  the  main  consideration  in  choosing  a discretized  model  in  the 
IMSC  is  whether  a closed-form  eigensolution  for  the  distributed  structure  exists 
or  not.  Moreover,  in  the  IMSC  method  it  is  possible  to  design  distributed 
controls,  whereas  in  coupled  controls  it  is  not.  It  will  prove  convenient  to 
divide  the  discussion  of  the  IMSC  method  according  to  the  type  of  actuators 
used. 


i.  Control  by  distributed  actuators 

Let  us  first  assume  that  a closed-form  eigensolution  for  the  distributed 
structure  exists,  so  that  the  entire  infinity  of  modes  is  known,  and  rewrite 
Eqs.  (31)  as  an  infinite  set  of  second-order  differential  equations  of  the 
form 


^ 2 

U + CO  u 

r r r 


r=l,2, 


(75) 


Note  that  ^ this  case  the  entire  infinity  of  modes  is  modeled,  and  there  are 
no  unmodeled  modes . 

In  coupled  controls,  the  modal  feedback  controls  f depend  in  general  on 
all  the  controlled  modal  coordinates  and  velocities,  so  that  in  such  a case  the 
feedback  controls  recouple  the  equations.  Here,  we  wish  to  consider  the  case 
in  which  the  feedback  controls  have  the  special  form 


fj.  = ’ r=l,2,...  (76) 

Because  the  modal  control  for  the  rth^mode  depends  only  on  the  rth  modal  coor- 
dinate u and  the  rth  modal  velocity  u , Eqs.  (75)  become  completely  decoupled, 
i.e. , they  become  both  internally  (plant-)  and  externally  (controller-)  de- 
coupled. Hence,  in  this  case  Eqs.  (75)  represent  an  infinite  set  of  independent 
modal  equations.  The  essence  of  the  independent  modal-space  control  (IMSC) 
method  is  to  design  modal  controls  of  the  type  (76) . There  is  no  restriction  on 
the  form  of  f , so  that  the  modal  controls  f can  be  linear  or  nonlinear  func- 
tions of  the  modal  coordinate  u and/or  modal  velocity  u^*  In  coupled  controls 
one  desTgns  the  actual  controls’^dlrectly . By  contrast,  in  the  IMSC  method  one 
designs  first  the  modal  controls  and  then  synthesizes  the  actual  controls  from 
the  modal  controls.  Indeed,  in  the  case  of  distributed  controls  one  can  synthe- 
size the  actual  control  force  f(P,t)  from  the  modal  controls  f by  means  of  the 
formula 


00 

f(P,t)  = E M(P)$  (P)f  (t)  (77) 

r=l 

In  using  the  IMSC  method,  one  can  control  the  entire  infinity  of  modes  or 
only  a finite  number  of  modes.  If  only  N modes  are  to  be  controlled,  then  one 
can  simply  take  f ^0  for  r=l,2,...,N^  and  f =0  for  r=N^  + 1»  + 2,.... 

Clearly,  because  all  modes  are  modeled,  there  are  only  two  classes  of  modes , 
controlled  and  uncontrolled . Moreover,  because  f =0  for  r = + 1,  + 2, 

. . . , there  is  no  control  spillover  into  the  uncontrolled  modes . 
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It  must  be  pointed  out  that  ^ system  subject  to  distributed  controls  is 
completely  controllable  by  definition.  Indeed,  controllability  is  strictly  a 
discrete  controls  concept  having  no  counterpart  in  distributed  controls. 


Next,  let  us  consider  the  question  of  modal-space  control  implementation. 
In  the  case  of  linear  controls,  the  modal  controls  can  be  taken  as 


f 

r 


G u + G -u  , r=l,2, . 
rl  r r2  r 


(78) 


where  G and  G ^ (r=l,2,...)  are  modal  gains , ordinarily  taken  as  real  nega- 
tive scalars,  fn  the  case  of  nonlinear,  on-off  controls,  the  modal  controls 
have  the  form 


f 

r 


u > d ; 
r r 


0, 


< d ;k  , 
I r r ■ 


r=l,2 , . . . 


(79) 


where  k is  a control  gain  parameter  and  2d  is  the  magnitude  of  the  deadband 
region.  Then,  the  actual  distributed  controls  are  synthesized  from  the  modal 
controls  by  Inserting  Eq.  (78)  or  Eqs.  (79)  into  Eq.  (77).  Note  that  in  the 
case  of  on-off  modal  controls,  if  a finite  number  of  modes  are  controlled, 
then  the  actual  control  of  any  point  P is  quantized,  i.e.,  it  has  the  form  of 
a staircase  in  time. 

The  control  of  discretized  structures  by  distributed  actuators  is  very 
similar  to  the  control  of  distributed  structures.  The  only  difference  is  that 
the  modes  corresponding  to  r > N are  not  known  with  sufficient  accuracy.  In 
this  case,  it  is  not  possible  to  control  the  entire  infinity  of  modes.  Hence, 
if  only  modes  are  to  be  controlled,  < N,  then  the  synthesis  can  be 

carried  out  simply  by  using  the  formula 

f(P,t)  = E M(P)$^(P)f^(t)  (80) 

r=l 


But,  as  shown  by  Eq.  (77),  distributed  controls  can  accommodate  an  infinity  of 
modal  coordinates  and  not  just  N^.  Taking  f =0  for  r = + 1,  + 2 , . . . , 

formulas  (77)  and  (80)  become  interchangeable,  so  that  it  does  not  matter 
whether  the  modes  corresponding  to  r > N are  known  or  not.  But,  because  dis- 
tributed controls  of  the  type  (80)  can  be  interpreted  to  mean  that  f =0  for 
r > N^,  it  can  be  concluded  that  once  again  control  spillover  into  tfie  uncon- 
trolled modes  does  not  exist.  As  in  the  case  of  distributed  structures,  in 
the  case  of  discretized  models  controlled  by  distributed  actuators  control- 
lability of  the  controlled  modes  is  guaranteed. 


ii.  Control  by  discrete  actuators 


Distributed  controls  have  many  advantages  over  discrete  controls.  In  many 
cases,  however,  distributed  controls  may  not  be  feasible,  so  that  one  must 
consider  control  implementation  by  discrete  actuators. 
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In  the  case  of  discrete  actuators,  only  a finite  number  of  modes  can  be 
controlled.  Let  us  assume  that  N modes  are  to  be  controlled,  which  makes  the 
infinity  of  remaining  modes  uncontrolled.  The  task  is  to  be  carried  out  by 
means  of  M discrete  actuators.  As  shown  earlier,  discrete  forces  can  be 
treated  as  distributed  by  using  Eq.  (35).  Denoting  the  vector  of  controlled 
modes  by  u^  and  the  vector  of  modal  control  forces  by  f and  using  Eq.  (43a) , 
the  modal  equations  for  the  controlled  modes  can  be  written  in  the  form 

+ Vc  = !c  = ®c! 


where  F is  the  M-vector  of  actuator  forces.  As  pointed  out  earlier,  in  the 
IMSC  method  one  designs  first  the  modal  control  vector  f^  and  then  synthesizes 
the  actual  control  vector  F from  f^.  From  the  right  side  of  Eq.  (81),  the 
synthesis  amounts  to  writing 

! ^ ®c!c  (82) 


where  is  the  pseudo-inverse  of  B . However,  pseudo-inverses  can  be  quite 
inaccurate.  It  is  possible  to  avoid  the  use  of  pseudo-inverses  by  taking  the 
dimension  of  F to  be  the  same  as  the  dimension  of  f^,  M = N^,  which  implies 
that  the  number  of  actuators  must  be  the  same  as  the  number  of  controlled 
modes . In  this  case,  B^  is  a square  matrix,  so  that  Eq.  (82)  reduces  to 


! ■ »c^!c 


(83) 


Note  that  the  controllability  of  u^  guaranteed  when  using  IMSC , because  one 
has  always  the  freedom  of  placing~the  actuators  in  a way  that  B ^ nonsingu- 
lar. 


Next,  let  us  examine  the  spillover  problem.  Denoting  the  infinite-dimen- 
sional vector  of  uncontrolled  modes  by  u^,  the  modal  equations  for  the  uncon- 
trolled modes  can  be  written  as 

• 

where  the  notation  is  obvious.  Whereas  f^^  depends  on  and  u^,  this  does  not 
imply  coupling  between  the  controlled  and  the  uncontrolled  modes.  Indeed,  as 
far  as  the  uncontrolled  modes  are  concerned,  the  vector  f^  is  a known  function 
of  time,  so  that  Eq.  (84)  clearly  indicates  that  control~spillover  into  the 
uncontrolled  modes  does  exist , but  because  the  frequency  components  of  f^ 
are  entirely  different  from  the  natural  frequencies  is  A^,  no  instability  due 
resonance  is  possible.  Because  in  IMSC  one  has  the  freedom  of  choosing  the 
location  of  the  actuators,  spillover  can  be  reduced  by  placing  them  so  as  not 
to  excite  critical  modes.  Moreover,  because  in  IMSC  control  of  modes  are 
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designed  independently  for  every  mode,  it  is  very  easy  to  control  any  other 
mode  that  may  be  critical  due  to  external  factors,  no  matter  how  high  the  mode 
is.  For  example,  one  is  able  to  control  the  first  four  modes  and  the  twelth 
mode,  without  controlling  any  intermediate  mode  or  modes  higher  than  the 
twelfth. 

It  should  be  pointed  out  that  control  spillover  into  the  uncontrolled 
modes  always  exists  when  using  discrete  actuators , and  this  is  true  regardless 
of  the  method  of  control.  The  only  way  control  spillover  can  be  eliminated  is 
through  the  use  of  distributed  controls,  and  this  is  possible  only  in  conjunc- 
tion with  IMSC  and  not  with  coupled  controls.  It  must  also  be  pointed  out 
that  the  control  spillover  is  not  as  serious  a problem  as  it  may  seem.  In  the 
first  place,  for  most  realistic  structures  higher  modes  are  extremely  difficult 
to  excite,  as  this  requires  large  amounts  of  energy.  In  addition,  all  struc- 
tures have  a certain  amount  of  inherent  damping,  which  tends  to  attenuate  higher 
modes  at  a higher  rate  than  lower  modes . 

Control  of  discretized  structures  by  discrete  actuators  is  carried  out  in 
exactly  the  same  manner  as  the  control  of  distributed  structures  by  discrete 
actuators  discussed  above.  Indeed,  as  long  as  only  N are  to  be  controlled, 
and  these  modes  are  known  accurately,  it  does  not  matter  whether  the  remaining 
modes  are  known  or  not. 


COMPUTATIONAL  ALGORITHMS  FOR  CONTROL 

Wtien  attempting  to  control  structures  by  techniques  of  modern  control, 
one  must  be  struck  by  the  fact  that  structures  are  basically  distributed 
systems  and  modern  control  theory  is  concerned  with  discrete  systems.  To 
circumvent  this  problem,  the  standard  approach  has  been  to  discretize  the 
structure,  which  leads  to  a problem  of  a different  kind.  Complicated  struc- 
tures require  discretized  models  of  relatively  high  order  and  computational 
algorithms  work  well  for  low-order  systems  but  experience  serious  difficulties 
for  high-order  systems.  This  is  the  situation  in  the  case  of  coupled  controls. 
Because  the  IMSC  method  reduces  the  control  problem  to  a set  of  independent 
second-order  systems,  its  advantages  should  become  immediately  obvious.  Not 
only  is  the  IMSC  unlikely  to  experience  computational  difficulties  with  discre- 
tized systems,  but  it  is  also  able  to  design  controls  for  distributed  struct- 
ures, if  all  its  modes  are  knoxm. 

There  are  two  commonly  used  techniques  for  computing  the  control  gains  in 
the  case  of  linear  controls,  namely,  the  pole  allocation  method  and  optimal 
controls . The  methods  are  discussed  in  the  following  in  the  context  of  coupled 
controls  and  IMSC. 

1.  Pole  allocation 


In  the  pole  allocation  method,  the  closed-loop  poles  Pj^(k=l,2, . . . ,2N^)  of 
the  controlled  modes  are  selected  in  advance,  and  then  the  control  gains  are 
computed  so  that  the  coefficient  matrix  of  the  closed-loop  system  possesses 
eigenvalues  equal  to  the  selected  poles.  First,  we  concentrate  on  coupled 
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controls.  The  application  is  relatively  simple  for  single  inputs.  In  this 
case,  the  state  equation  for  the  controlled  modes  reduces  to 


Yc  ==  ^cYc  + 


(85) 


It  is  shown  in  Ref.  9 that  the  single  input  F(t)  can  be  expressed  in  the  form 


F(t)  = g'^v^(t) 


(86) 


where  the  gain  vector  g has  the  expression 


2N  TT^l 

E >^-1 

(87) 

II 

!-■ 

S 

2N 

Yj^c 

in  which  v^  are  the  left  eigenvectors  of  and  are  the  open-loop  poles. 

In  the  case  of  multi- input  controls,  one  must  solve  a set  of  2N^  nonlinear 
algebraic  equations  for  the  elements  of  the  gain  matrix  (Ref.  9).  In  addi- 
tion to  being  nonlinear,  the  set  of  equations  is  undetermined,  so  that  no 
unique  solution  exists.  The  undetermined  equations  can  be  augmented  by 
additional  equations , leading  to  special  types  of  controllers , such  as  "minimum- 
gain  modal  controllers"  and  "prescribed-gain  modal  controllers"  (Ref.  9). 

Both  techniques  are  not  feasible  for  large-order  systems. 


Another  approach  to  computing  gains  for  multi-input  controllers  is  referred 
to  as  "dyadic  control"  and  is  due  to  Simon  and  Mitter  (Ref.  8).  The  computa- 
tional difficulties  associated  with  the  nonlinear  equations  are  circumvented 
by  considering  a special  class  of  systems  characterized  by  a control  gain 
matrix  G^  in  the  form  of  an  outer  (dyadic)  product  of  two  vectors,  one  of 
which  (denoted  by  h)  is  chosen  arbitrarily,  but  in  a way  so  as  to  ensure  the 
controllability  of~the  modes,  and  the  other  (a  row  vector)  is  determined  as 
the  feedback  gain  of  a single-input  system  by  using  the  closed-form  solution 
of  Ref.  8.  The  resulting  gain  matrix  G^  has  always  rank  one,  as  its  rows  are 
proportional  to  one  another,  from  which  it  follows  that  the  control  inputs  are 
proportional  to  one  another.  The  required  control  law  can  be  shown  to  have 
the  form 


F(t)  = 


h E 
~3=1 


. T 

kSi(Pk  ^j^Yj 


T . 2Nc 

Yj®ci?k2i 


(88) 
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Hence,  dyadic,  control  does  not  represent  a genuine  multi-input  control,  as  it 
restricts  the  freedom  of  designing  the  components  of  F(t)  as  distinct  func- 
tions of  time.  It  can  be  concluded  from  the  above  that  for  the  case  of  coupled 
controls,  the  pole  allocation  method  is  recommended  only  when  a single  actuator 
is  used  and  a low  number  of  modes  is  controlled. 

In  contrast  to  coupled  controls,  when  the  IMSC  method  is  used  computation 
of  the  modal  controls  for  desired  closed-loop  poles  is  very  easy.  Denoting 
the  closed-loop  poles  by 

Pj-  = “r  - ^'^r’  ^ . . (89) 


and  Inserting  the  modal  forces  f as  given  by  Eqs.  (78)  into  Eqs.  (75),  it  is 
not  difficult  to  verify  that  the  modal  controls  corresponding  to  the  poles 
defined  by  Eqs.  (89)  have  the  form 
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2a  u + (o) 
r r r 


r = 1,2 , . . . 


(90) 


Then,  the  actual  controls  are  obtained  by  inserting  Eqs.  (90)  into  Eq.  (71) 
for  distributed  controls  and  into  Eq.  (82)  for  discrete  controls. 

ii . Linear  optimal  control 

In  the  case  of  coupled  controls,  we  consider  the  performance  index 
0 = / ^ (v^Qv_  + F\F)dt  (91) 

where  Q and  E.  are  weighting  matrices.  Minimization  of  J leads  to  the  optimal 
control  vector 


F(t)  = - R"^B'\(t)v^(t) 


(92) 


where  K(t)  is  a 2N^  x 2N^  symmetric  matrix  satisfying  the  matrix  Riccati 
equation 

K = -KA  - A^KQ  - Kb'r“^b"^K  (93) 

Because  K is^symmetric,  the  number  of  simultaneous  nonlinear  equations  to  be 
solved  is  2N^  + N_.  For  large  N , computational  difficulties  arise. 

Cj  L»  O 

In  the  steady-state  case,  K = 0,  we  obtain  the  algebraic  matrix  Riccati 
equation.  Through  a series  of  linear  transformations,  the  solution  of  the 
steady-state  Riccati  equation  can  be  reduced  to  the  elgensolutlon  of  a real 
general  matrix  (Ref.  17).  Even  though  stable  elgensolutlon  algorithms  for 
general  matrices  exist,  such  algorithms  are  likely  to  experience  difficulties 
for  relatively  high-order  matrices.  It  is  estimated  that  the  solution  of  the 
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matrix  Riccati  equation  requires  in  excess  of  600N^  multiplications  for  con- 
vergence. Hence,  the  computational  time  increases  dramatically  with  the 
number  of  controlled  modes . 

In  the  transient  case,  the  nonlinear  Riccati  equation  can  be  transformed 
into  a 4N^  x 4N  linear  matrix  equation,  which  must  be  integrated  on-line. 

For  large  N , this  requires  ample  computer  capacity,  so  that  an  on-line  pro- 
cess may  not  be  possible. 

In  the  case  of  the  IMSC  method,  the  performance  index  can  be  expressed  as 
(Ref.  13) 

OO 

J = E J (94) 


where  are  Independent  modal  performance  indices  of  the  form 

•^r  " ^ ^ = 1.2,...  (95) 

in  which  v is  the  rth  modal  state,  Q is  a 2 x 2 diagonal  matrix  and  R is  a 
scalar.  To  compute  the  control  gains,  one  must  solve  now  only  a set  of  inde- 
pendent 2x2  matrix  Riccati  equations.  If  only  N_  modes  are  controlled  by 
discrete  actuators,  then^the  number  of  operations  to  determine  the  control 
forces  is  of  the  order  Nj^/2,  which  are  required  primarily  to  invert  for 
use  in  Eq.  (83).  Hence,  the  computational  time  required  by  IMSC  is  smaller  by 
a factor  of  1,200  compared  with  coupled  controls.  Of  course,  in  the  case  of 
distributed  controls  by  IMSC,  no  matrix  inversion  is  necessary  and  the  compu- 
tational time  reduces  dramatically.  Note  that  in  practice  one  controls  only  a 
finite  number  of  modes. 

As  a final  item  of  interest,  let  us  compute  the  closed-loop  poles  for 
optimal  control  designed  by  the  IMSC  method.  From  Ref.  13,  we  obtain  the 
modal  controls 

f^  = o)J(l-  7l+R‘bu^  - 0J^[2(-1+ Ji+r“^)+r“^]^/^u^,  r=l,2,...  (96) 

so  that,  comparing  Eqs.  (90)  and  (96)  and  considering  Eq.  (89),  we  conclude 
that  the  closed-loop  poles  are 


p = - -|  0)  [2(-l+  /l+R  ^)+r“^]^^^  + i w [2(1+  /i+R  ^)-R 
^r  2r  V r r — 2r  '»  r r 


r = 1,2, . . . (97) 

From  the  above,  it  is  evident  that  IMSC  offers  a wider  choice  of  computa- 
tional techniques  than  coupled  controls,  including  nonlinear  control.  IMSC 
and  coupled  controls  have  been  compared  on  the  basis  of  computational  effort 
and  control  energy  and  IMSC  was  found  superior  (Ref.  18).  In  addition,  in 
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the  case  of  IMSC  the  control  energy  is  independent  of  the  location  of  the 
actuators  (Ref.  19).  Finally,  the  IMSC  was  demonstrated  to  be  robust  with 
respect  to  changes  in  the  system  parameters  (Ref.  20). 


SUMMARY  AND  CONCLUSIONS  ' 

In  designing  controls  for  structures,  one  is  faced  with  the  dilemma  that 
structures  are  basically  distributed-parameter  systems  and  the  control  theory  is 
concerned  almost  exclusively  with  discrete  systems.  To  overcome  this  impasse, 
the  approach  commonly  used  has  been  to  represent  a distributed  structure  by  a 
discrete  model,  i.e. , to  discretize  the  structure.  But,  in  discretizing  a 
distributed  structure,  one  must  exercise  extreme  care,  as  no  discretized  model 
is  capable  of  representing  a distributed  system  with  complete  accuracy. 

Virtually  all  methods  used  for  the  control  of  structure  represent  one  form 
or  another  of  modal  control,  whereby  one  attempts  to  control  the  motion  of  a 
structure  by  controlling  its  modes.  To  this  end,  it  is  customary  to  express 
the  motion  of  the  structure  in  terms  of  modal  coordinates.  In  the  case  of 
open-loop  control,  the  modal  equations  of  motion  are  independent.  In  the  case 
of  feedback  controls,  the  feedback  forces  depend  in  general  on  the  controlled 
modal  coordinates  and  velocities,  the  modal  equations  of  motion  are  recoupled 
by  the  feedback  controls,  unless  the  feedback  controls  are  designed  so  as  to 
preserve  the  independence  of  the  modal  equations. 

In  implementing  feedback  controls,  it  is  necessary  to  know  the  modal  states 
for  the  modes  to  be  controlled.  These  states  can  be  estimated  from  the  measured 
output  of  the  structure  by  means  of  a Luenberger  observer.  In  general,  however, 
a Luenberger  observer  does  not  take  advantage  of  the  orthogormality  property 
of  the  modes.  If  a sufficiently  large  number  of  sensors  is  used,  the  modal 
states  can  be  estimated  more  directly  by  means  of  so-called  modal  filters.  By 
contrast,  the  modal  filters  do  take  advantage  of  the  orthogomality  of  modes. 

As  a result,  observation  spillover  is  virtually  eliminated. 

In  general,  the  feedback  controls  depend  on  all  the  controlled  states, 
giving  rise  to  coupled  controls.  In  using  coupled  controls,  one  is  limited  to 
linear  controls,  as  nonlinear  controls  are  not  feasible.  One  of  the  major 
problems  in  designing  coupled  feedback  controls  is  the  computation  of  the 
control  gains.  In  the  first  place,  all  computational  algorithms  for  the 
determination  of  the  control  gains  lead  to  finite-dimensional  control  gain 
matrices.  This  implies  the  assumption  that  the  actuators  are  discrete  in 
nature,  which  precludes  distributed  controls.  Secondly  the  computational 
algorithms  encounter  difficulties  for  control  gain  matrices  of  large  dimensions, 
so  that  one  is  limited  in  the  number  of  modes  that  can  be  controlled  and  in  the 
number  of  actuators  that  can  be  used. 

Another  modal  approach  consists  of  designing  controls  independently  for 
each  mode  and  is  known  as  the  independent-modal-space  control  (IMSC)  method. 

One  of  the  advantages  of  IMSC  is  that  it  permits  design  of  distributed  controls, 
which  eliminates  control  spillover  completely.  This  makes  it  ideally  suited  for 
the  control  of  flimsy  structures,  such  as  antenna  membranes,  for  which  discrete 
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actuators  are  likely  to  produce  the  difficulties  mentioned  above. 
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AUTHOR'S  PREFACE 


To  set  the  context  of  this  paper,  one  or  two  prefatory  remarks  may  be 
helpful . 

Last  summer,  at  the  Third  "Blacksburg"  Conference  on  this  subject,  I was 
surprised  to  hear  several  speakers  refer  to  the  "fact"  that  "real"  structures 
have  "an  infinite  number  of  modes."  These  remarks  were  usually  accompanied  by 
the  strong  implication  that  any  (mathematical)  model  of  a structure  that  did  not 
possess  this  essential  characteristic  was  quite  suspect,  and  that  such  models 
would  therefore  be  difficult  for  sophisticated  persons  to  tolerate.  In  fairness 
to  the  structural  analysis  community,  I should  hasten  to  add  that  this  Infinite 
Modes  Assertion  was  made  chiefly  by  speakers  who,  whatever  else  their  achieve- 
ments, were  not  distinguished  as  structural  analysts.  If  pressed  to  guess,  I 
would  suppose  their  backgrounds  to  be  in  controls  and  applied  mathematics. 

In  any  case,  repeated  references  to  the  Infinite  Modes  Assertion  at 
Blacksburg  HC  prompted  my  recollection  of  a similar  occasion  just  six  years  ear- 
lier where,  at  what  some  call  the  Zeroth  Blacksburg  Conference  (organized  by  Prof. 
Peter  Likins  at  UCLA),  the  kickoff  panel  session  was  titled  "Primitive  Methods." 
Not  wishing  to  offend  the  members  of  that  panel.  Prof.  Likins  explained  that  in 
choosing  this  session  title  he  was  not  implying  that  the  panel  members  were  them- 
selves primitive.  Instead,  he  said,  he  was  using  the  word  "primitive"  in  a nar- 
row technical  sense,  to  refer  to  methods  based  on  "first  principles."  In  essence, 
this  meant  the  use  of.  partial  differential  equations. 

In  spite  of  Prof.  Likins' disclaimer , however,  there  remained  the  notion 
that  if  one's  capability  to  analyse  the  dynamics  of  flexible  space  structures  did 
not  extend  beyond  PDE's,  one  was  rather  handicapped.  That  notion  seemed  sensible 
in  1975,  and  it  seems  even  more  sensible  today.  Unfortunately,  this  notion  tends 
in  practice  to  be  inconsistent  with  the  Infinite  Modes  Assertion  (for  reasons  to 
be  reviewed  in  this  paper)  . 

To  return  to  Blacksburg  HE,  I had  the  temerity  during  an  end-of-conference 
panel  session  to  question  not  only  the  importance  of  the  Infinite  Modes  Assertion, 
but  the  Assertion  itself.  I would  like  to  thank  Dr.  G.  Rodriguez  of  JPL,  who  was 
present  on  that  occasion,  for  the  opportunity  to  expand  on  this  theme  at  this 
workshop . 


HOW  MANY  VIBRATION  MODES  DOES  A REAL  STRUCTURE  HAVE? 


A 'vibration  mode' 


refers  to  a motion  that  is  physically  possible  in  the 
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absence  of  any  external  influence,  and  in  which  the  elastic  displacements  ^(ri,t) 
at  position  jr  and  time  t all  move  in  unison:  all  displacements  pass  through  zero 

simultaneously,  and  they  all  attain  their  maxima  simultaneously.  The  concept  of 
a 'vibration  mode'  is,  in  fact,  a mathematical  concept  and  can  be  stated  most 
precisely  and  succinctly  in  mathematical  form:  if  a distribution  of  elastic  dis- 

placements of  the  form 

u(r,t)  = i(r)n(t)  (1) 

is  autonomously  possible,  ^(r)  is  called  the  'mode  shape'  and  n(t)  shov/s  the  time 
dependence  shared  by  the  elastic  displacements  at  all  points  in  the  structure. 

It  is  plain  from  (1)  that  the  idea  of  'mode  shape'  is  a special  case  of  the  more 
general  mathematical  idea  of  'separation  of  variables'. 


Realization  vs.  Idealization 


Much  of  the  following  argument  rests  on  the  important  distinction  between 
a 'real'  (i.e.,  physical)  structure  and  someone's  mathematical  model  of  that  real 
structure.  This  distinction  is,  of  course,  essential  on  a philosophical  level: 
whether  dealing  with  high-energy  particle  physics,  black  holes,  or  flexible  space 
structures,  one  is  wise  to  discriminate  between  a symbolic  representation  of  rea- 
lity and  reality  itself.  However,  one  hardly  needs  to  evoke  the  Scientific 
Method  to  justify  the  distinction  between  the  real  structure  and  its  mathematical 
representation.  First,  there  is  an  almost  unlimited  quantity  of  experimental 
data  on  the  dynamics  of  real  structures;  virtually  none  of  this  data  agrees  ex- 
actly with  'theory'.  Second,  if  one  returns  to  the  fundamental  assumptions  that 
underlie  'theory',  it  is  apparent  that  a large  number  of  idealizations  are  made. 
These  assumptions  and  idealizations  are  normally  reasonable  and  defensible,  but 
collectively  they  do  constitute  a well -documented  case  for  distinguishing  bet- 
ween the  structure  itself  and  its  mathematical  model. 


Take,  for  example,  what  is  arguably  the  simplest  structure  of  all--the 
long,  slender,  uniform,  cantilevered  rod.  This  'structure'  is  shown  in  Fig.  la. 
(Its  cousin,  the  'two-rod  satellite',  accompanies  it  in  Fig.  lb.)  As  is  well 
known,  the  PDE  and  associated  end  conditions  for  the  lateral  displacements  of  the 
rod  are 


EIu""  + pu  = f(x,t) 

u(0,t)  = u'(0,t)  = u"(f,t)  = u'"(£,t)  = 0 


(2) 


(A  table  of  symbols  is  appended.) 


(a)  Long,  Slender,  Uniform 
Cantilevered  Rod 


Fig.  1: 


(b)  Simple  Flexible  Satellite 

The  'Simplest'  Cases 
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Yet  the  following  idealizations  must  be  made  to  arrive  at  the  Euler- 
Bernoulli  equation  (above)  for  this  'structure':  (a)  material  continuum,  (b) 

perfectly  elastic  material,  (c)  stress  proportional  to  strain,  (d)  infinitesi- 
mally small  deflections,  (e)  perfectly  cantilevered  root,  (f)  negligible  rota- 
tional inertia,  (g)  negligible  shear  deflections.  This  list  is  undoubtedly  in- 
complete but  amply  long  enough  already  to  demonstrate  that  properties  of  the  PDE 
(2)  will  not  likely  be  exactly  the  same  as  the  corresponding  properties  of  actual 
long  slender  uniform  cantilevered  rods.  Experimental  evidence  tends  to  support 
this  expectation;  the  model  (2)  is  reasonable  for  many  purposes  if  used  intelli- 
gently, but  (2)  is  not  in  any  sense  an  emot  representation  of  reality. 


The  Infinite  Modes  Assertion 

There  is  no  doubt  that  the  PDE  (2)  has  modes  of  the  form  (1),  and  that  it 
has  an  infinite  number  of  such  modes.  The  question  at  issue  is  whether  real  rods 
also  possess  these  properties.  To  state  that  a real  structure  has  an  infinite 
number  of  modes  is,  on  reflection,  to  state  an  absurdity.  How  can  a structure 
have  more  modes  than  it  has  molecules,  or,  for  that  matter,  than  there  are  mole- 
cules in  the  known  universe?  What  does  a frequency  of  u = 10^°°  Hz  mean?  Does 
it  mean,  among  other  things,  that  particles  in  the  structure  move  faster  than  the 
speed  of  light? 

At  this  point  the  reader  may  retort,  "Wait  a minute.  Let's  not  be  extreme 
When  someone  asserts  that  a structure  has  an  infinite  number  of  modes,  all  he 
really  means  is  that  the  structure  has  a very  large  (but  finite)  number  of  modes." 
Not  so,  in  the  author's  experience.  The  Infinite  Modes  Assertion  is  often  made 
at  technical  meetings  to  an  audience  that  includes  individuals  who  are  familiar 
with  structural  models  that  contain  thousands  of  degrees  of  freedom  (and  there- 
fore thousands  of  modes).  To  make  the  Assertion  to  such  an  audience  clearly 
means  that  thousands  of  modes  is  not  enough  (in  the  Assertor's  opinion);  nothing 
less  than  infinity  will  do. 

Yet  it  is  clear  that  the  Assertion  is  wrong,  on  the  grounds  of  physical 
impossibility. 

"All  right,"  the  reader  may  persist,  "the  Assertion  is  indeed  made  (in  its 
strong  form)  and  it  is  indeed  wrong,  but  It  is,  after  all,  only  a harmless  mis- 
understanding". Again  not  so,  in  the  author's  opinion.  Mil  1 ion-dol lar  R & D 
contract  proposals  on  the  dynamics  and  control  of  large  space  structures  are  cur- 
rently under  technical  adjudication.  If  the  adjudicators  fall  prey  to  a corol- 
lary of  the  Assertion--namely,  that  any  methodology  that  does  not  use  PDE's  is 
faulty--they  will  tend  to  favor  proposals  that  promise  an  infinite  number  of 
modes.  In  most  cases,  this  viewpoint  would  be  unwise  and  unjust. 


How  Many  Modes  Are  There? 

If  a physical  structure  does  not  have  an  infinite  number  of  modes,  how 
many  vibration  modes  does  it  have?  The  most  precise  (but  not  very  helpful)  ans- 
wer is:  "none".  As  an  approximation,  the  mathematical  concept  of  a 'mode'  is 

still  very  useful,  however.  This  is  especially  true  for  the  lower  modes.  On  the 
other  hand,  as  one  goes  higher  and  higher  in  mode  number  (past  the  100th  mode, 
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say,  or  the  1000th)  the  mathematical  idea  of  a 'mode'  tends  to  become  increas- 
ingly inappropriate  until,  somewhere  well  this  side  of  infinity,  it  is  wholly  in- 
appropriate. 

To  emphasize  this  idea,  we  introduce  the  following  definition  in  connec- 
tion with  mode  shapes  as  a set  of  basis  functions: 

Definition:  The  absurd  subspaoe  associated  with  a PDE  idealization 

of  a structure  is  the  subspace  spanned  by  all  but  the 
first  billion  modes. 

All  PDE  structural  models  have  an  absurd  subspace.  This  absurd  subspace  is  a 
flaw  in  these  models  but  not  an  important  one  (unless  glorified  by  the  Assertion). 

It  is  a curious  paradox  that  the  greatest  advantage  of  modal  analysis-- 
the  analyst  can  expand  the  general  motion  of  a complex  structure  approximately  in 
terms  of  a few  important  submotions— is  lost  if  an  infinite  number  of  modes  is 
insisted  upon. 


THE  FINITE  ELEMENT  METHOD 

When  one  analyses  structures  in  general,  one  is  not  bothered  by  the  neces- 
sity of  generating  numerical  information.  For  example,  it  may  suffice  to  say 
that  the  small  deflection  iu(r,st)  is  related  to  the  excitation  f(jl,t)  via  an  ap- 
propriate operator  j<  that  is, 

= £(i^,t)  (3) 

where  a is  the  mass  density.  K is  a symmetric,  3x3,  partial  differential  stiff- 
ness operator.  Assuming  that  rigid  displacements  are  prevented  (as  in  Fig.  2), 

K is  positive  definite.  The  mode  shapes  for  Eq.  (3)  satisfy 

(4) 

and  the  orthonormality  conditions  are 


l^(r)ig(r)dm  = 6^^ 


(5) 


where  dm  = a(r.)dV.  For  a system  that  deserves  to  be  called  a 'structure',  there 
will  be  an  infinite  number  of  eigenfunctions  (mode  shapes).  However,  as  we  have 
seen  above,  the  real  structure  that  Eq.  (3)  represents  does  not  share  this  'in- 
finite-modes' characteristic. 

The  modal  coefficients  of  momentum 
and  angular  momentum  (about  0)  are  defined 
as  follows: 


p = (|)  dm  ; h = 

—a 


r^<P  dm 


(6) 


It  can  be  shown  (Ref.  1)  that  the  modal 


Fiq.  2:  General  Elastic  Structure 
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identities  in  the  first  column  of  Table  1 are  satisfied  by  these  coefficients. 
These  modal  identities  and  results  like  Eqs.  (3),  (4)  and  (5)  for  the  generic 
structure  of  Fig.  2 are  powerful  in  that  they  apply  to  all  structures  that  sat- 
isfy the  general  assumptions  that  underlie  Eq.  (3). 


The  'Mathematical  Solution'  Swindle 

Operations  like  the  integration  /^(  )dm  in  Eq.  (5),  or  the  in  Table  1 
can  be  performed  with  the  stroke  of  a pen.  Engineers  dealing  with  speoifio  space 
structures  require  numerical  data,  not  just  elegant  theoretical  results. 

The  classical  method  for  dealing  with  PDE's  like  Eq.  (3)  is  to  expand  the 
solution  in  terms  of  a series  of  functions  that  are  defined,  named,  examined, 
cataloged,  and  expounded  upon.  Usually  these  functions  are  not  especially  easy 
to  calculate.  Even  worse  is  to  define  the  solution  of  Eg.  (3)  in  terms  of  a dif- 
ficult integral.  This  "solution"  (as  the  mathematicians  call  it)  is  in  practical 
terms  often  just  another  mathematically  equivalent  way  of  stating  the  problem. 

The  Knotkwit  function,  whose  origins  are  traced  in  Appendix  A,  furnishes  an  ex- 
ample of  the  different  meanings  that  may  be  attached  to  the  word  'solution'  by  a 
mathematician  and  an  engineer. 

Even  the  functions  sin,  cos,  sinh,  cosh  that  make  up  the  well-known  solu- 
tion for  the  vibration  modes  of  the  simple  rod  in  Fig.  la  require  some  numerical 
sophistication  to  calculate  efficiently.  For  most  structures  of  practical  in- 
terest, 'closed-form'  solutions  are  not  available  and,  even  if  they  were,  they 
would  not  likely  be  much  help  in  numerical  calculations. 


The  Ritz  Method  Revisited 

Frustrated  by  their  difficulties  in  formulating  PDE's  for  complex  struc- 
tures, and  their  further  difficulties  in  extracting  numerical  information  from 
these  PDE's  once  they  have  them,  structural  analysts  began  to  chop  up  complicated 
structures  (on  paper)  into  small  elements.  Each  of  these  elements  could  be  ana- 
lysed and  numerical  data  of  the  required  accuracy  extracted  relatively  easily. 
Initially  this  approach  rested  for  its  justification  on  physical  understanding, 
but  applied  mathematicians  (e.g..  Ref.  2)  have  since  shown  that,  if  properly 
used,  this  finite  element  method  model  (FEM  model)  is,  in  fact,  an  ingenious  im- 
plementation of  the  much  older  method  of  Ritz.  A FEM  model  therefore  enjoys  the 
same  theoretical  foundations  as  the  Ritz  method.  In  particular,  the  conditions 
for  convergence  are  known.  This  convergence  is  to  the  so-called  'exact'  solu- 
tion, i.e.,  to  the  elusive  solution  of  the  PDE  model  that  has  the  same  modeling 
assumptions  as  the  FEM. 

This  property  of  convergence  is  a highly  desirable  one  and  can  often  be 
used  to  advantage--in  connection  with  the  identities  of  Table  1,  for  example. 

But  in  our  celebration  of  this  convergence  to  the  'exact'  solution  we  should  not 
overlook  the  fact  that  the  'exact'  solution  is  'exact'  only  for  the  PDE  model. 

It  is  not  'exact'  at  all  for  the  actual  structure  because  the  PDE  model  is  not 
exact  for  the  actual  structure. 

This  raises  the  following  question:  How  can  an  'error'  of  (say)  1%  matter. 
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when  the  'error'  is  with  respect  to  an  equation  that  is  itself  only  valid  to 
within  (say)  10%?  Yet  it  is  this  sort  of  error,  no  matter  how  small  (and  it  can 
be  made  as  small  as  desired  by  using  sufficient  finite  elements),  that  seems  to 
be  the  chief  concern  of  the  Infinite  Mode  Assertors,  They  do  not  trust  the  FEM 
model  because  it  fails  to  predict  the  'absurd  subspace'  (see  earlier  definition). 
In  the  author's  opinion,  however,  this  'failure'  is  trivial  and  should,  if  any- 
thing, be  counted  as  a point  in  the  FEM  model's  favor  because  the  absurd  subspace 
doesn't  exist  physically  anyway. 


Unification 

To  this  point  in  the  discussion  the  FEM  model  and  the  PDE  model  have  been 
treated  as  though  they  were  competing  alternatives.  They  are  in  an  important 
sense  the  same  model.  The  FEM  model  should  be  viewed  as  a numerical  treatment 
of  a corresponding  PDE  model.  The  finite  element  method  must  surely  be  one  of 
the  most  spectacular  success  stories  in  the  history  of  engineering  analysis. 

FEM  models  circumvent  the  formulational  and  computational  difficulties  of  their 
PDE  counterpart  models,  while  at  the  same  time  providing  a numerical  approxi- 
mation to  the  latter  that  can  be  made  arbitrarily  accurate.  If  enough  modeling 
elements  are  used,  the  error  due  to  a finite  number  of  coordinates  can  always  be 
restricted  to  an  'absurd  subspace'.  The  strength  of  the  FEM  model  is  that  one 
can  do  numerical  calculations  for  complicated  structures;  the  weakness  of  the 
FEM  model  is  that  it  can  never  be  better  than  the  associated  PDE  model  to  which 
it  converges. 


USES  AND  ABUSES  OF  LONG  SLENDER  RODS 


A long,  slender  uniform  cantilevered  rod  appears  in  Fig.  1 and  its  PDE 
model  is  given  by  Eq.  (2).  The  attraction  of  this  'structure'  is  its  simplicity 
and  this  makes  it  ideal  as  a learning  tool.  It  provides  a simple  example  for 
students  being  introduced  to  structural  dynamics.  For  much  the  same  reasons  it 
is  often  cited  to  help  in  explaining  new  ideas  to  colleagues.  Moreover,  many 
satellites  have  rod-like  appendages;  in  such  cases  the  closed-form  characteris- 
tics of  cantilevered  rods  (summarized  in  Appendix  B)  have  direct  practical 
utility. 

Nevertheless,  because  of  its  seductive  simplicity,  the  slender  rod  struc- 
ture tends  to  be  focused  upon  rather  more  often  than  its  limited  range  of  appli- 
cation would  warrant.  In  fact,  the  Infinite  Modes  Assertion  is  often  a symptom 
of  slender-rod  overemphasis.  If  all  the  structures  in  the  world  were  long  slen- 
der rods,  there  certainly  would  be  no  need  for  the  finite  element  method,  at 
least  not  for  structures.  Slender  rod  enthusiasts  often  seem  to  imply  that  FEM 
models  are  really  only  undignified  'engineering  approximations'.  If  such  an 
enthusiast  also  wishes  to  ignore  the  crucial  distinction  between  a physical 
structure  and  its  PDE  model,  he  has  the  right  mind-set  for  accepting  the  Infinite 
Modes  Assertion. 
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Modal  Convergence 


As  a prelude  to  addressing  the  question  'Which  modes  are  important?'  we 
shall  ourselves  also  use  the  long  slender  rod  as  a convenient  starting  point. 
Then,  in  the  next  section,  a more  realistic  (and  complicated)  structure  will  be 
discussed.  The  notation  and  results  in  Appendix  B will  be  taken  for  granted  here. 

The  modal  identities  of  Table  1 can  be  used  as  indicators  of  the  error  in- 
troduced into  a structural  model  by  modal  truncation  (i.e.,  error  with  respect 
to  the  'exact'  PDE  representation,  which  is,  as  we  have  said  repeatedly,  not  to 
be  trusted  too  far  itself).  The  modal  parameters  p^  and  h„  are  shown  for  the 
first  few  modes  in  Fig.  3.  It  is  evident  that  they  decrease  nonotonically  with 
mode  number  and  that  h^^  decreases  with  a faster  than  p^^.  These  observations  can 
be  made  also  from  Fig.  4,  where  the  model  error  indices 

Cl(N)  = 1-2  I X-V  (7) 

a=l 

e^in)  = 1-12  I (8) 

a=l 


have  been  introduced,  corresponding  respectively  to  the  p,^  and  the  h^.  With  no 
modes,  ei(0)  = £2(0)  = 1.  For  all  the  theoretiaally  infinite  number  of  modes. 
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Fig.  3:  Momentum  Coefficients  for 

Slender  Rod 
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Fiq.  4:  Measures  of  Model  Error 


37 


Also  shown  in  Fici.  4 is  the  third  measure  of  error. 


£3(N)  = 1 


1680 

11 


I ^ 


a=l 


(9) 


(see  last  entry  in  Table  1).  This  error  indicator  takes  both  momentum  coeffi- 
cients and 'frequencies  into  account  and  is  thus  a more  plausible  measure  of  mo- 
del error  than  or  e2.  The  index  03  recognizes  that,  other  things  being  equal 
('other  things'  in  this  case  being  and  h„),  the  low-frequency  modes  are  more 
important  than  the  high-frequency  modes.  If  one  wished  to  have  a maximum  of  1% 
model  error,  for  example,  as  measured  by  only  the  1st  mode  should  be  retain- 
ed and  the  rest  deleted. 


LARGE  DEPLOYABLE  SPACE  REFLECTOR 


Long,  slender,  uniform,  cantilevered  rods  can  be  carried  only  so  far. 

They  are  useful  in  teaching  certain  basic  lessons,  but  some  of  these  lessons  are 
not  true  for  more  general  structures.  Therefore  we  now  consider  a typical  space 
structure  of  current  interest--a  large  deployable  space  reflector.  Shown  in  Fig. 
5 is  the  wrap-rib  antenna  reflector  developed  by  the  Lockheed  Missiles  and  Space 
Corporation  (Ref.  3).  A FEM  model  has  been  developed  for  this  reflector  by  the 
Jet  Propulsion  Laboratory  (Ref.  4)  and  a typical  mode  shape,  taken  from  Ref.  4, 
is  shown  in  Fig.  6. 


Fig.  5:  Lockheed  Wrap-Rib  Reflector  Used  on  ATS  6 
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This  model  has  several  complexities  that  a simple  rod  does  not  have.  The 
first  is  that  a PDE  model  is  very  difficult  and  does  not  seem  to  have  even  been 
attempted.  This  leads  to  the  use  of  a FEM  model.  The  second  complexity  is  three- 
dimensional  ity.  For  example,  the  model  momentum  coefficients  and  the  modal 
angular  momentum  coefficients  h are  no  longer  scalars,  but  are  3x1. 


A Criterion  for  Mode  Selection 


A more  subtle  distinction  between  the  wrap-rib  reflector  and  the  slender 
rod  is  that  simple  modal  trunaation  becomes  generalized  to  a process  of  mode  se- 
lection. A glance  back  at  Fig.  3 shows  that  for  a slender  rod  the  p^  and  h^^ 
decrease  monotonical ly  with  a.  In  other  words,  whether  we  order  the  importance 
of  the  modes  according  to  increasing  frequency,  or  according  to  decreasing  p^,  or 
according  to  decreasing  h(^,  the  order  of  the  modes  is  unchanged.  This  lesson, 
learned  well  for  slender  rods,  must  be  unlearned  for  more  complex  structures. 

The  question  of  which  modes  to  keep  is  not  simply  a question  of  'keeping  the 
first  N'  and  dropping  the  rest.  There  are  several  ideas  available  (Refs.  5,6)for 
mode  selection,  and  the  ones  that  rely  solely  on  the  structural  dynamics  are 

those  that  depend  on  u , p , and  h . 

a —a 

We  can,,  for  example,  take  the  first  three  modal  identities  in  Table  1. 
These  three  matrix  identities  correspond  to  18  (independent)  scalar  identities. 

To  create  a single  scalar  indicator  of  how  well  these  18  identities  are  being 
satisfied,  it  is  observed  that  they  may  be  written  as 


CO 


where  the  definitions 
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h hT 
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_ 

(10) 


(11) 
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have  been  introduced.  Then  the  following  scalar  quantity  is  a measure  of  how 
well  these  identities  are  satisfied  after  the  first  N modes: 


= p[i 


I N 

■^(  I M )M'^] 

o ^ ^ —00  -* 

a=l 


(12) 


where  p[-]  stands  for  the  spectral  norm  of  [•].  Note  that  1^  is  here  the  6x6 
unit  matrix,  while  in  Eq.  (11)  1^  refers  to  the  3x3  unit  matrix.  (In  other 
words,  l_  always  stands  for  a unit  matrix  of  compatible  size.) 

/ X N 

The  reasoning  behind  Eq.  (12)  is  as  follows:  the  sum  is  normalized 

based  on  Eq.  (10)  in  such  a manner  that  symmetry  is  retained.  The  resulting 
matrix  is  compared  to  the  ideal  sum,  1^.  The  cumulative  sum  in  Eq.  (12)  is  non- 
decreasing since  1^  is  positive  semi -definite.  The  matrix  difference  in  Eq.  (12) 
must  be  positive  definite  for  finite  N.  Thus  its  eigenvalues  will  be  six  real 
numbers  between  0 and  1.  The  greatest  of  these  six  numbers  is  defined  to  be  the 
error,  E|^^(N). 
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Fig.  7:  Reduction  of  Model  Error 

by  First  42  Modes  Using 
only  Inertial  Quantities 
in  Error  Measure,  i.e.. 
Using  Eq.  (12) 


The  error  em(N)  is  plotted  in  Fig.  7 for  data  typical  of  a wrap-rib  re- 
flector with  48  rios  and  44.4  m in  diameter.  Even  after  42  modes,  £|v|(42)  = 0.66. 

This  slow  convergence  prompts  the  following  comments. 

(a)  In  the  model  used,  some  of  the  higher-wave-number  modes  have  already  been 
deleted.  However,  it  is  not  expected  that  they  would  contribute  materi- 
ally to  £jy|.  (This  is,  in  fact,  why  they  were  deleted.) 

(b)  Just  because  the  eiv|(N)  vs.  N curve  is  'flat'  does  not  mean  that  intermedi- 
ate modes  are  not  making  a positive  contribution.  This  behavior  just 
means  that  they  are  not  contributing  to  reducing  the  maximum  eigenvalue  of 
the  matrix  in  Eq.  ( 12) . 

(c)  A more  detailed  examination  of  the  six  eigenvalues  of  the  matrix  in  Eq. 

(12)  discloses  that  it  is  the  ""  nii  identity  that  is  slow  to  con- 
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verge.  This  is  in  accordance  with  the  slow  convergence  of  for  the 
slender  rod  in  Fig,  3.  The  so-called  'breathing'  modes  for  the  wrap-rib 
reflector  are  few  and  far  between;  yet  it  is  these  modes  that  must  pro- 
duce convergence  in  the  (3,3)  element  of 


A Better  Criterion  for  Mode  Selection 


Obviously  the  error  criterion  (12)  is  excessively  harsh.  It  is  counter- 
intuitive that  a 42-mode  model  can  have  a 66%  error.  A goodly  part  of  the  pro- 
blem is  that  the  criterion  (12)  does  not  take  the  frequencies  wct  account.  One 
of  the  messages  in  this  paper  is  that  frequency  is  not  the  only  parameter  of  im- 
portance in  modal  selection.  However,  it  would  be  extreme  in  the  opposite  direc- 
tion to  exclude  the  (d^  entirely,  as  Eq,  (12)  does.  We  therefore  consider  instead 
the  last  three  modal  identities  in  Table  1,  These  identities  may  be  combined  in- 
to the  single  6x6  identity. 
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where  the  definitions 
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have  been  used. 


(13) 


(14) 

(15) 


The  modal  identity  (13)  suggests  the  following  model  error  indicator: 


e,(N)  = pti I (16a) 

a=l 

This  indicator  is  patterned  after  Eq.  (12),  and  is  plotted  in  Fig.  8.  According 
to  this  indicator,  if  an  error  of  only  2,5%  were  the  most  that  could  be  tolerated 
in  the  model,  the  first  28  modes  would  have  to  be  kept. 

There  is,  however,  a hidden  premise  in  this  last  procedure,  namely,  the 
premise  that  the  modes  must  be  selected  in  their  natural  order  (i.e.,  by  in- 
creasing frequency).  There  is  no  basis  for  this  premise  or  this  procedure. 

Figure  3 shows  that,  for  a slender  rod,  p^^  and  h,^  decrease  monotonical ly  with  a, 
as  would  p^/<JDa’  Thus,  for  a slender  rod,  all  methods  of  ordering 

modes  produce  the  same  order--the  'natural'  order.  For  more  complex  structures 
this  is  no  longer  true.  The  error  indicator  in  Eq.  (16a)  can  therefore  be  im- 
proved (i.e.,  fewer  modes  required  for  the  same  model  accuracy)  by  taking  the 
modes  in  the  cumulative  sum  in  a different  order.  Thus  we  replace  Eq.  (16a)  by 
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and  p is  defined  by 

a 


= P[ 


— oo  — 


1 


(18) 


(Note,  however,  that  the  spectral  radius  operator  does  not  commute  in  addition; 
that  is 


“n 

+ 1 - I P„ 

a=a^^ 

as  might  be  assumed  at  first  sight.) 

As  can  be  inferred  from  Fig.  9,  p certainly  does  not  decrease  monotoni- 
cally  with  a.  This  would  suggest  that  tBe  re-ordering  of  modes  required  by  Eg. 
(16b)  should  be  beneficial.  The  second  plot  in  Fig.  8 shows  that  this  is  indeed 
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the  case.  In  fact,  only  9 modes  are  now  needed  to  give  as  low  as  2.5%  error--a 
saving  of  19  modes  (and  a reduction  in  system  order  by  38  state  variables)  over 
the  previous  un-re-ordered  scheme.  Evidently  mode  selection  can  be,  for  complex 
structures,  far  superior  to  simple  modal  truncation. 


CONCLUDING  REMARKS 


In  summary,  the  main  points  discussed  in  this  paper  are  the  following: 

(a)  Neither  a PDE  model  nor  any  other  mathematical  model  of  a structure  is 
exact. 

(b)  For  complicated  structures,  PDE  models  are  very  difficult  to  formulate  and 
very  difficult  to  extract  numerical  information  from. 

(c)  Even  when  a PDE  model  does  exist,  the  'solution'  in  terms  of  'known  func- 
tions' may  still  require  considerable  effort  to  extract  numerical  infor- 
mation. 

(d)  Viewed  as  a Ritz  method,  a FEM  model  is  not  in  competition  with  the  cor- 
responding PDE  model;  it  is,  instead,  a very  powerful  numerical  method  for 
solving  the  PDE  model. 

(e)  The  idea  of  a 'mode'  is,  in  essence,  a mathematical  one.  It  is  highly  un- 
likely that  any  real  structure  can  vibrate  exaotly  so  that  all  its  points 
move  in  unison;  in  other  words,  it  is  highly  unlikely  that  any  structure 
has  any  modes.  As  an  approximation,  however,  the  idea  of  a mode  is  an  ex- 
cellent one  for  many  structures,  especially  for  the  'lower  modes'.  The 
agreement  between  experiment  and  theory  for  the  'higher  modes'  tends  to 
become  weaker. 

(f)  In  this  approximate  sense,  most  structures  have  a very  large  number  of 
modes.  It  is  elementary  to  show,  however,  that  no  real  structure  has  an 
infinite  number  of  modes.  The  Infinite  Modes  Assertion  is  false. 

(g)  The  only  utility  of  the  Infinite  Modes  idea  is  within  the  purely  mathema- 
tical domain.  See,  for  example,  the  modal  identities  in  Table  1. 

(h)  The  long,  slender,  uniform  cantilevered  rod  has  a simplicity  that  is  at 
once  helpful  and  dangerous.  It  is  a reasonable  structure  on  which  to  ex- 
plain a new  idea,  or  to  test  a new  idea,  but  the  validation  or  generaliz- 
ation of  the  idea  must  be  carried  out  on  structures  of  more  realistic 
complexity. 

(i)  Many  'error  indices'  can  be  defined  as  guidelines  for  structural  modal 
order  reduction.  Simple  modal  truncation,  although  suggested  by  ex- 
perience with  slender  rods,  is  naive.  The  proper  process  is  mode  selec- 
tion, based  on  an  appropriate  error  criterion. 

(j)  The  error  criterion  in  Eg.  (12)  is  unnecessarily  pessimistic  because  it 
ignores  frequency  information.  It  is  as  naive  as  a 'frequencies-only' 
criterion,  at  the  opposite  extreme. 

(k)  The  error  criterion  in  Eq,  (16)  is  superior  to  Eq.  (12),  especially  if  the 
modes  are  selected  according  to  the  order  specified  by  Eq.  (17).  This  is 
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illustrated  for  a wrap-rib  antenna  reflector  in  Fig.  8. 
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Appendix  A - The  Origin  of  the  -Knotkwit  Function 


Some  years  ago , the  eminent  applied 
mathematician  Professor  Will  Knotkwit 
encountered  in  his  theoretical  study  of 
structures  a certain  PDE  whose  solution 
he  could  not  express  in  closed  form. 

Nor  could  he  express  the  solution  in 
terms  of  known  functions . Eventually 
an  important  idea  occurred  to  Prof. 
Knotkwit:  he  introduced  a new  function 

that  was,  by  definition,  the  solution 
of  his  troublesome  equation.  He  pro- 


ceeded to  write  several  papers  on  the 
interesting  mathematical  properties  of 
the  Knotkwit  function  (as  it  became 
known  shortly  before  his  retirement) . 
Professor  Knotkwit  even  lived  to  see  his 
function  referred  to,  by  one  of  his  for- 
mer graduate  students,  as  a 'known' 
function. 

It  is  not  likely  that  the  Knotkwit 
function  will  ever  be  called  an  'ele- 
mentary' function.  What  is  clear,  how- 
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ever,  is  that  any  solution  to  a struct- 
ural dynamics  problem  that  can  be  writ- 
ten in  terms  of  Knotkwit  functions , or 
even  that  can  be  expressed  as  an  inte- 
gral whose  integrand  involves  Knotkwit 
functions  in  a fairly  simple  manner. 


will  be  called  a 'closed-form'  solution. 

Thus,  ultimately.  Professor  Knot- 
kwit achieved  his  'closed-form'  solu- 
tion in  terms  of  'known'  functions. 


Appendix  B - Long  Slender  Rod  Modes 
The  well-known  solution  to  Eq.  (2)  is 


where 


and 


where 


u(x,t)  = <j>^(x)n^(t) 

a=l 


n + = <l>  (x)f(x,t)dx 

a a a ' ' ' 


(f)  = (p£)"^[(coshA  £ - COSA  £)  - K (sinhA  5 - sinA  f)] 

Ot  Ct  (X  0^  vtr  LX 


2/ 

pm  Z 

X = 

a El 


£ = X ’ 
^ I 


S -s 

a a 

'a  C + C 
a a 


(Bl) 


(B2) 

(B3) 

(B4) 


with  s = sinA  , c = cosA  , S = sinhA  , C = coshA  , 

a a a a a a a a 

The  natural  frequencies  are  calculated  by  numerical  solution  of  the  trans- 
cendental equation 


c C +1  = 0 
a a 


(B5) 


The  mode  shapes  of  Eq.  (B3)  can  readily  be  shown  (directly  from  the  differential 
equation)  to  satisfy  the  orthogonality  conditions 

J "0  (a  f 3)  (B6) 


It  is  more  onerous  to  show  that  Eq.  (B3)  satisfies  the  normality  condition 

•^2  2 
(f)  dm  = pi  (|>  dc  = 1 


(B7) 


This  latter  fact  is  often  omitted  from  textbook  discussions. 
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In  free  vibration,  the  force  and  torque  on  the  rod  at  0 are  (see  Fig.  la): 


F(t)  = 


P n 

I a a 


a=l 


(B8) 


where  and  h(^  are  the  coefficients  given  (in  general)  by  Eq,  (6). 
sent  simple  'structure', 
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2(p£)\  /A 
' ' a a 


For  our  pre- 


(B9) 


h 


a 


f 2 

x4>  dm  = pi 


A 

0 


C<|)^d5  2(pl^)'Vx^ 


(10) 


Therefore  the  modal  identities  of  the  first  column  in  Table  1,  which  assume  the 
special  form  shown  in  the  second  column  for  a slender  rod,  imply  the  identities 
shown  in  the  third  column  in  Table  1.  Note  that  the  sums  involve  an  infinitude 
of  transcendental  numbers. 


Appendix  C - Table  of  Symbols 


Roman 

Greek 

£ 

first  moment  of  inertia,  /rdm 

a 

modal  index 

El 

flexural  rigidity  of  a long 

1 if  a = 3;  otherwise  0 

slender  rod 

modal  coordinate  associated  with 

f(x,t) 

force  per  unit  length,  at  posi- 

a 

mode  a 

tion  X,  at  time  t 

K 

see  Eq.  (B4)  in  Appendix  B 

F(r,0 

deflection  at  position  _r,  due  to 
unit  force  at  position  £ 

ot 

X 

a 

see  Eq.  (B4)  in  Appendix  B 

h 

modal  angular  momentum  coeffi- 

5 

x/l  for  slender  rod 

—a 

cient;  see  Eq.  (6) 

i 

dummy  position  vector 

j 

(second)  moment-of-inertia  matrix 

P 

mass  per  unit  length  for  slender 

rod 

stiffness  operator 
rod  length 

a 

mass  density  function 

i 

m 

mass 

mode  shape  for  mode  a 

N 

number  of  modes  retained 

0) 

a 

natural  frequency  for  mode  a 

modal  momentum  coefficient;  see 

Special 

Symbols 

Eq.  (6) 

r 

position  vector 

p[  ] 

spectral  radius 

t 

time 

1 

unit  matrix  (of  appropr.  size) 

£ 

small  elastic  displacement 

( )' 

spatial  derivative 

X 

distance  along  slender  rod 

(*) 

temporal  derivative 
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TABLE  I : SUMMARY  OF  MODAL  IDENTITIES 
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A SYSTEMATIC  OVERVIEW  OF  THE  PROBLEM 
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ABSTRACT 

In  the  theory  of  flexible  structures  control,  there  are  two  approaches 
that  are  usually  advocated.  The  first  is  based  on  the  theory  of  partial  dif- 
ferential equations  and  the  statement  of  the  problem  is  : "If  one  does  not  use 
a continuum  approach  there  will  always  be  a problem  that  has  been  overlooked 
and  any  design  will  not  work  when  implemented  on  a real  system."  In  contrast 
the  second  approach  is  based  on  the  tacit  assumption  that  "finite  is  be- 
autiful." This  school  contends  that  a finite  model  can  always  be  made  to  ap- 
proximate the  original  system  to  any  desired  level  of  accuracy.  Following  this 
strategem,  the  design  is  based  on  a finite  dimensional  model  whose  order  is 
determined  by  a scientific  method  such  as  mode  truncation,  residue  matching 
methods  (based  on  the  partial  fraction  expansion  or  the  eigenvectors  from  the 
diagonalizing  transformation)  or  using  modal  costs.  These  two  schools  are  not 
irreconci lable. 

The  key  to  developing  a reduced  order  model  that  subsumes  the  objections 
of  the  first  school  while  providing  the  finite  model  required  by  the  second 
school  is  the  realization  that  no  reduced  model  can  be  specified  without  first 
specifying  the  closed  loop  gains.  A high  gain  system  means  a large  control 
bandwidth;  a large  bandwidth  means  that  many  more  flexible  modes  must  be  re- 
tained. On  the  other  hand,  the  controller  bandwidth  is  determined  by  the  de- 
sired performance  and  the  disturbances  that  will  cause  performance  de- 
gradation. It  is  important  to  realize  that  no  control  is  required  if  the  sys- 
tem is  not  being  disturbed.  We  are  confronted  with  a tail  chasing  dog  syn- 
drome. We  can't  design  the  control  until  we  have  a model  and  we  can't  get  a 
model  until  we've  designed  the  control.  This  is  the  major  argument  used  by  the 
spokesmen  of  the  distributed  or  p.d.e.  school. 

The  compromise  position  advocated  here  is  to  design  the  system  using 
finite  methods  and  verify  the  design's  robustness  using  results  from  the  con- 
tinuum school.  Thus  one  should  follow  the  recipe: 

0 Determine  the  expected  disturbances  from  both  external  and 
internal  sources  (including  the  measurement  devices). 

0 Determine  the  control  bandwidth  that  will  satisfy  the  desired 
specifications  (i.e.  reduce  the  disturbance  amplitudes). 

0 Truncate  the  structural  model  to  include  all  frequencies 
up  to  some  multiple  of  the  control  bandwidth. 
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0 Of  the  structural  modes  retained,  additionally  discard 
those  that  have  low  mode  costs  (i.e.  that  are  not 
excited  by  the  disturbances  because  they  are  either 
almost  uncontrollable  or  are  not  observable  through 
the  performance  measure  used  for  evaluating  the  design.) 

0 Carefully  evaluate  the  potential  interactions  of  the 

sensors  and  actuators  at  their  locations  to  insure  that 
unstably  interacting  modes  are  not  discarded  even  though 
they  don't  contribute  to  the  cost. 

0 Design  the  controller,  the  compensation  and  the  feedforward 

control  using  a reduced  state  feedback  control  design  algorithm. 

0 Verify  that  the  bandwidth  of  the  controller  is  that  assumed 

at  the  start  of  the  design  process.  If  not  - redesign  the 

control . 

0 Verify  stability,  robustness  and  performance  of  the  final  design. 

This  paper  reviews  each  of  these  design  steps.  It  is  shown  that  without 
damping  there  is  no  finite  model  that  can  be  used  to  design  the  control  sys- 
tem. The  definitions  of  stably  and  unstably  interacting  modes  are  given,  and 
the  design  of  reduced  order  control  and  compensator  systems  is  described.  The 
algorithm  for  the  reduced  order  control  that  uses  an  explicit  computation  of 
the  gradient  and  Hessian  of  the  performance  index  is  shown. 


INTRODUCTION 


There  are  many  concepts  in  the  control  of  distributed  systems  that  are 
important  for  practical  control  design.  However,  no  matter  what  one  thinks  of 
the  continuum  approach,  at  some  step  in  the  design  process  it  must  be  dis- 
carded to  allow  the  designer  to  use  a finite  controller.  It  is  not  possible  to 
implement  a continuum  design  nor  is  it  possible  to  compute  such  a design  for 
any  practical  structure.  Thus  the  issue  in  the  continuum  vs.  finite  argument 
is  not  whether  or  not  one  must  use  a finite  model,  but  when.  It  is  possible  to 
carry  a linear  design  through  to  its  Riccati  p.d.e.  form  and  then  use  a 
Galerkin  method  or  other  projection  concept  to  develop  a finite  controller 
(see  for  example  Gibson  Ref.  1).  The  alternative  is  to  use  a finite  model  at 
the  start.  This  is  usually  prefered  by  the  control  community  whose  task  it  is 
to  design  controllers  for  flexible  structures  because  of  the  availability  of 
structural  analysis  tools  like  NASTRAN. 

To  minimize  the  number  of  definitions  we  have  to  introduce,  the  notation 
used  here  follows  that  of  Ref. 2.  The  theoretical  results  discussed  here  are 
proved  in  Refs.  2 and  3,  and  the  example  that  is  described  appears  in  an 
INTELSAT  report  (Ref.  4).  The  last  step  of  the  design  process  in  which  the 
spillover  bounds  are  evaluated  from  the  p.d.e.  description  of  the  system  and 
the  robustness  is  verified  is  not  discussed  here.  These  can  be  found  in  the 
papers  by  Balas  (Refs. 5 and  6)  and  Kosut  (Ref. 7). 
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FINITE  DIMENSIONAL  MODELS  FOR  STRUCTURAL  CONTROL 


The  only  reason  that  control  is  required  in  any  system  is  that  the  dis- 
turbances cause  the  system  to  be  perturbed  away  from  some  nominal  value 
(position,  velocity  etc.)  and  this  perturbation  is  large  enough  to  cause  the 
specifications  to  be  exceeded.  Therefore  the  starting  point  for  any  control 
design  discussion  must  be  the  disturbances  and  the  specifications.  The 
specifications  determine  the  performance  index  that  is  used  to  determine  an 
optimal  or  suboptimal  design.  Typical  performance  measures  include  line  of 
sight  motion  of  a satellite,  the  line  of  sight  motion  of  the  optical  axis  of  a 
telescope,  or  the  pointing  direction  of  an  antenna.  These  can  always  be  re- 
lated to  the  small  angle  motions  of  a structure  and  through  those  motions  to 
the  displacements  of  the  structure  from  some  nominal  position.  Therefore  the 
criteria  for  which  disturbances  are  important  is  the  contribution  to  the  per- 
formance index  of  those  disturbances  that  are  expected.  This  is  the  message  of 
the  modal  cost  approach  of  Skelton.  It  is  not  necessary  to  use  modal  costs  to 
determine  the  modes  that  should  be  retained  if  the  design  starts  with  a con- 
tinuum approach.  Therefore,  the  continuum  model  is  used  initially  to  develop  a 
rough  measure  of  the  required  bandwidth  of  the  system  subject  to  the  dis- 
turbances as  they  are  expected. 

If  the  output  of  a sensor  is  denoted  by  yj  then,  from  Fig.l,  the  La- 
place transform  of  the  output  is  given  by 


y,(s)  = H J (s)  y 

yjis;  - s“+24o>kS  + c4 

where: 

H(.(s)  is  the  control  actuator  transfer  function 
is  the  disturbance  model  transfer  function 
Hpj(s)  is  the  measurement  devices  transfer  function 
w(s)  is  the  Laplace  transform  of  the  disturbance 

u(s)  is  the  Laplace  transform  of  the  control  input 

The  remaining  terms  are  defined  in  Fig.  1. 


(1) 


Using  (1),  the  infinite  dimensional  system  has  been  written  as  an 
infinite  series  whose  individual  terms  represent  the  Laplace  transform  of  the 
modes  of  the  system.  This  is  the  resolvent  of  the  semigroup  of  the  system 
written  as  a sum.  Since  the  operators  are  bounded,  this  series  always  con- 
verges and  becomes  the  actual  (transcendental)  resolvent.  Because  it  more 
closely  resembles  the  approximate  transfer  functions  that  result  from  finite 
models,  the  form  (1)  is  used  here  to  develop  the  arguments  that  follow. 

Using  the  form  (1),  it  was  shown  in  Ref.  2 that  the  transfer  function  of 
a finite  approximation  to  (1)  becomes  asymptotically  invariant  as  the  number 
of  modes  increases.  This  result  is  useful  because  it  says  that  if  enough  modes 
are  retained  then  a finite  model  exists  that  will  be  arbitrarily  close  to  the 
infinite  dimensional  system.  The  unfortunate  aspect  of  this  approximation  is 
that  the  modes  are  sinusoidal,  and  the  convergence  of  the  inverse  transform  of 
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TRANSFORMATIONS 
TO  MODAL  COORDINATES 


TRANSFORMATIONS 
TO  MEASUREMENT  COORDINATES 
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b(x)  Is  the  control  Influence 
c (x)  and  e (x)  the  measurement  Influences 

(x)  Is  the  mode  shape 


Figure  1 

Block  Diagram  of  Structural  Dynamics  Control  Problem 


the  series  that  results  from  (1)  is  only  in  I2  so  we  have  a Gibbs  phenomena 
effect.  Thus  the  convergence  is  always  poor  at  step  type  of  discontinuities. 
This  effect  was  noticed  by  Wie  and  Bryson  (Ref.  8).  Another  consequence  of 
this  approximation  is  that  a system  whose  transcendental  transfer  function  is 
minimum  phase  can  result  in  a finite  approximation  that  is  non-minimum  phase. 
This  result  also  is  shown  in  Ref.  8 and  its  consequences  are  also  discussed 
in  Ref. 2 and  Ref.  9. 

Since  a finite  model  can  always  be  selected  that  approximates  the  con- 
tinuum description  to  within  an  arbitrarily  small  error,  the  next  question  is 
how  many  modes  must  be  kept.  As  was  pointed  out  in  the  introduction,  this 
question  can  only  be  answered  based  on  the  bandwidth  of  the  controller.  Thus 
if  the  control  bandwidth  is  f^-,  then  the  highest  mode  frequency  that  must  be 
retained  is  lOn  f^.,  where  the  damping  is  lO"*^.  This  result  is  proved  in 
Ref.  2 and  can  be  loosely  justified  based  on  the  magnitude  of  the  resonance 
peaks  that  appear  in  the  transfer  function  (1).  At  resonance,  the  amplitude  of 
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the  Bode  plot  magnitude  is  given  by  l/(2£^i  — g|)  , where  the  frequency  of  the 

mode  is  and  the  frequency  at  which  the  resonance  occurs  is  . 

Thus  on  a log  magnitude  plot  the  amplitude  in  db  of  the  resonance  is  20 
logio  H(jw),  which  is  20n-6  db  at  the  resonance.  Therefore,  since  only  those 
modes  that  have  an  amplitude  that  exceeds  0 db  can  cause  a loss  of  stability 
or  performance,  the  criteria  for  discarding  high  frequency  modes  depends  on 
the  control  bandwidth  and  the  fact  that  for  any  well  behaved  control  the 
closed  loop  response  "rolls  off"  at  least  20  db/decade  after  f^.  Thus  the 
first  frequency  that  guarantees  that  the  resonance  is  always  below  the  0 db 
line  is  n decades  away  from  f^  or  n/2  decades  away  if  the  system  rolls  off 
at  40  db/decade  (which  usually  happens  for  any  system  with  actuators  and 
sensors  because  of  their  dynamics). 


STABLE  AND  UNSTABLE  MODE  INTERACTIONS 


The  question  of  which  modes  are  to  be  retained  is  not  completely  answered 
by  the  criteria  described  above.  There  are  many  modes  that  should  be  discarded 
because  they  are  not  controllable,  observable,  or  disturbable  (when  we  say 
not,  we  should  strictly  speaking  say  weakly,  because  it  is  almost  never  true 
that  a mode  is  completely  unobservable  or  uncontrollable  in  practice).  However 
there  is  a significant  caveat  that  must  be  heeded  when  discarding  modes.  This 
is  the  rule  that  one  never  throws  away  a mode  that  is  unstably  interacting. 
The  criteria  for  determining  the  stability  of  a mode  is  shown  in  Fig. 2.  The 
important  point  is  that  there  is  a phase  shift  between  the  measurement  and 
control  caused  by  the  relative  positions  of  the  actuators  and  sensors.  This 
phase  shift  is  either  0°  or  180°  depending  on  whether  the  actuator  and  sensor 
are  on  opposite  sides  of  a modes  node.  The  best  way  of  determining  this  is  by 
using  the  pictures  of  the  mode  shapes,  as  is  shown  in  Fig. 2 


B.  Unstably  Interacting 


Figure  2 

Mode  Shapes  for  Stable  and  Unstable  Interactions 
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The  evaluation  of  which  modes  to  discard  based  on  mode  costs  can  be  done 
in  a simple  and  straightforward  graphical  way  by  plotting  the  Laplace  trans- 
form of  the  costs  on  a gain/phase  plot  for  the  disturbances  expected.  This 
plot  gives  a visualization  of  the  levels  of  disturbance  at  any  frequency  and 
relates  both  the  disturbance  spectrum  and  the  dynamics  of  the  structure  to  the 
cost.  An  example  of  such  a plot  is  shown  if  Fig.  3.  This  figure  also  shows  the 
mode  numbers  that  are  causing  significant  motion  and  the  pole  zero  order  as 
one  moves  upward  in  frequency,  Poles  that  occur  before  zeros  near  the  jw  axis 
are  unstably  interacting  and  vice  versa.  Thus,  a plot  such  as  Fig.  3 can  be 
used  to  truncate  additional  modes  from  the  problem  based  on  mode  costs  without 
a complicated  calculation  of  the  costs. 

There  are  several  issues  that  arise  in  the  debate  on  control  of  large 
str'uctures  that  are  resolved  by  the  definition  of  stable  and  unstable  mode 
interactions.  The  main  issue  is  the  question  of  whether  or  not  a design  ap- 
proach leads  to  spillover  instability.  Since  the  modes  that  interact  stably 
can  not  be  driven  unstable  by  any  gain  change,  any  design  result  that  shows  no 
stability  loss  or  good  gain  and  phase  margins  when  additional  modes  are  added 
are  probably  adding  only  stably  interacting  modes.  This  is  particularly  true 
if  the  assumption  is  made  that  the  actuators  and  sensors  are  colocated. 


The  mode  cost  for  this  design  was  L.O.S.  motion,  and  the  disturbance  was  a 
monochromatic  sinusoid  at  the  frequency  shown. 
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The  second  issue  that  is  clarified  by  the  definition  of  stable  and  un- 
stable modes  is  what  happens  when  there  are  geometry  changes.  The  only  way 
that  a controller  can  stabilize  an  unstably  interacting  mode  is  to  introduce 
180°  of  phase  shift  into  the  control  loop  at  the  frequency  of  the  unstably 
interacting  mode.  Thus  if  one  were  to  plot  the  frequency  response  of  an  ob- 
server or  Kalman  filter  from  any  sensor  to  any  actuator,  those  modes  that 
interact  unstably  will  be  notched  (i.e.  there  will  be  a deliberate  180°  phase 
shift  at  the  mode  frequency).  This  has  dire  consequences  when  the  geometry 
changes.  Either  the  mode  frequency  change  or  the  mode  shape  change  can  cause 
the  phase  shift  to  be  introduced  at  the  wrong  frequency  and  the  resulting  de- 
sign will  be  unstable  since  the  180°  phase  shift  introduced  to  stabilize  the 
original  design  is  now  at  the  wrong  frequency  in  the  geometrically  different 
structure  (because  of  the  phase  shift  introduced  by  the  controller  in  its 
attempt  to  stabilize  the  interaction  we  now  have  a control  with  gain  >1  at  a 
phase  of  180°,  which  is  unstable).  The  message  is  clear:  if  there  are 
geometry  changes  and  there  are  unstably  interacting  modes,  some  way  must  be 
developed  to  track  the  mode  frequencies,  i.e,  adapt,  or  else  the  design  will 
not  work. 


REDUCED  STATE  FEEDBACK  CONTROL  FOR  LARGE  STRUCTURES 


In  Ref.  2,  a reduced  state  feedback  control  design  algorithm  developed  by 
Rossi,  Ref.  11  was  briefly  described.  We  have  successfully  used  this  algorithm 
to  design  a control  system  for  a large  structure  using  it  to  develop  the 
feedback  gains  for  a constrained  configuration  that  connects  the  output  of  a 
particular  sensor  to  a particular  actuator.  Simultaneously  the  algorithm 
derives  the  poles  and  zeros  for  compensators  that  are  required  by  making  these 
pole-zero  locations  gains  that  are  also  determined  by  the  algorithm. 

If  the  control  system  to  be  designed  is  structured  as  shown  in  Fig. 4, 
then  the  optimal  reduced  state  feedback  algorithm  is  used  to  determine  the 
feedback  and  feedforward  gains  that  optimize  the  performance  index: 


J=  1 zTq^z  + uTru  dt  (2) 


where; 

z is  the  output  (which  is  not  necessarily  the  sensor) 
u is  the  control 
Q2  is  the  weight  on  the  output 
R is  the  weight  on  the  control 

The  reduced  state  algorithm  gives  a solution  that  depends  on  the  initial  con- 
ditions since  (2)  is  equivalent  to  Xq^Pxq.  The  minimizing  feedback  gain 
is  determined  from  a search.  The  algorithm  for  determining  the  minimum  uses  an 
explicit  calculation  of  the  gradient  and  Hessian  tensors  for  J and  the  search 
is  done  in  four  steps.  The  first  step  is  to  compute  the  gradient  and  Hessian 
matrices  and  then  to  diagonalize  the  Hessian.  Since  the  Hessian  is  symmetric, 
the  diagonal ization  can  be  performed  by  an  orthogal  transformation.  Since  in 
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general  the  Hessian  will  not  be  positive  definite,  the  negative  eigenvalues 
are  arbitrarily  changed  in  sign  to  make  the  step  direction  correspond  to  a 
locally  quadratic  curve  fit.  Thus  if  Hq  represents  the  Hessian  matrix  at  the 
zeroth  iteration,  this  step  consists  of  forming  the  following  matrices: 


Ho=  [Vi  Vg] 


Di  0 

0 -D2 


(3) 


where: 

Di  2 diagonal  matrices 

^l\z  3re  the  elements  of  the  orthogonal  transformation 


H^=  [Vj  V2] 


D,  0 VjT 

0 D2  V2T 


The  only  difference  between  Hq  and  H"^  is  that  the  second  matrix  is  now 
positive  definite. 


Figure  4 

Block  Diagram  of  the  Reduced  State  Feedback  Control  Design 
The  control  gains  shown  are  selected  to  provide  the  lowest  cost  J 
for  the  configuration  shown.  Only  the  gains  shown  are  derived  by  the 
algorithm  giving  both  the  feedback  control  and  the  compensator. 
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The  second  step  in  the  algorithm  is  the  determination  of  the  step 
direction  for  the  search.  This  is  done  by  using  the  Taylor  series  for  the  cost 
as  follows: 

J(Ki)=  J(Ko)  + (Ki-Ko)Tg  + Q(Ko,Ki,H+)  + ...  (5) 

where:  Q is  the  quadratic  term  in  the  artificial  Hessian  H'*' 

K is  the  vector., of  gains  that  are  being  optimized 
G is  the  gradient  (in  this  case  a vector) 


From  (Ei)  the  step  direction  is  given  by  -[H'*’]“^G.  Thus  the  third  part 
of  the  algorithm  is  the  determination  of  this  step  from  the  approximate 
Hessian.  The  fourth  step  before  the  process  is  repeated  again  is  the  de- 
termination of  the  step  magnitude.  This  is  accomplished  using  a one 
dimensional  optimization  so  that  Kj  = KQ+(a)sQ,  where  Sq  is  the  step 
direction  determined  above  and  "a"  is  the  parameter  that  is  to  be  determined 
by  the  one  dimensional  search  so  that  J is  minimized.  Many  different  heuristic 
algorithms  have  been  used  to  determine  "a".  One  of  the  important  aspects  of 
the  algorithm  is  that  its  value  is  never  permitted  to  cause  the  gains  to  re- 
sult in  an  unstable  solution.  This  is  done  by  altering  the  one  dimensional 
search  if  the  step  size  is  too  big.  Stability  is  tested  as  a by  product  of  the 
gradient  and  Hessian  calculations. 

The  most  interesting  aspect  of  the  optimization  algorithm  is  the  fact 
that  the  gradient  and  Hessian  are  developed  from  the  same  equations.  These  are 
Lyaporiov  type  equations  and  are  therefore  solved  using  the  same  algorithm.  The 
gradient  and  Hessian  matrices  satisfy  equations  given  in  Refs.  10  and  11. 

The  overall  algorithm  flow  is  shown  in  Fig.  5.  In  practice  we  find  that 
the  solutions  obtained  from  the  reduced  state  feedback  control  designs  are 
within  one  or  two  percent  of  the  full  state  designs  with  orders  of  magnitude 
fewer  gains.  The  important  features  of  this  design  approach  is  that  it  allows 
one  to  incorporate  the  actuator  dynamics  and  the  sensor  dynamics,  the  noises 
on  both  the  sensor  and  the  disturbances  exciting  the  structure  (even  if  they 
are  correlated)  and  the  specifications  in  terms  of  a pre-specified  model.  If 
the  latter  is  used,  the  model  states  are  included  in  the  dynamic  description 
of  the  system  (in  the  A matrix,  the  B matrix  and  the  measurememnt  matrix)  and 
are  used  to  define  the  errors  that  are  optimized  in  the  performance  index, 
but  the  feedback  gains  from  the  model  states  are  not  used.  This  becomes  an 
implicit  model  following  approach.  The  last  feature  that  is  important  is  the 
explicit  incorporation  of  the  compensator  dynamics.  Fig.  4 shows  the  way  the 
compensator  dynamics  are  included  when  it  is  desired  to  design  a notch  filter 
for  removing  the  influence  of  an  unstably  interacting  mode. 


EXAMPLE 


The  spacecraft  design  shown  in  Fig.  6 is  an  example  of  a communication 
satellite  with  a large  solar  array,  an  antenna  that  deploys,  and  a requirement 
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Figure  5 

Constrained  Optimal  Control  Algorithm  Flow  Chart 


0?46-03QO 


Illustrative  Example: 


Figure  6 

Communication  Satellite  Finite  Element  Model 
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on  the  pointing  of  the  antenna  and  the  relative  location  of  the  antenna  with 
respect  to  its  feed.  The  finite  element  model  shown  has  enough  detail  to  allow 
the  modes  up  to  3 Hz.  to  be  accurately  determined.  Three  Hertz  is  20  times  the 
highest  frequency  contained  in  the  disturbances,  which  for  the  damping  assumed 
in  this  model  is  sufficient  to  guarantee  that  the  higher  modes  are 
unimportant.  This  model  is  the  starting  point  for  the  design.  The  mode  shapes 
are  shown  in  Fig.  7,  and  of  those  the  unstably  interacting  modes  are  indicated 
with  an  asterisk.  These  unstable  interactions  come  about  because  there  are  two 
sets  of  three  actuator  ~ the  first  at  the  central  or  core  mast  which  contains 
the  "rigid  body"  actuators  which  are  reaction  wheels  for  the  three  axes  of 
rotation.  The  second  set  of  actuators  are  at  the  antenna  mast  attachment 
point.  There  is  a two  degree  of  freedom  gimbal  joint  and  a worm  screw  device 
to  deflect  the  boom  that  carries  the  antenna  relative  to  the  mast.  This  last 
control  provides  control  of  the  feed  horn  to  antenna  displacement.  The  sensors 
are  three  position  and  rate  sensors  at  the  central  core  plus  three  position 
and  rate  sensors  at  the  antenna  joint,  and  a displacement  and  rate  sensor  that 
gives  the  antenna  position  relative  to  the  feed.  Thus  there  are  14 
measurments,  and  6 actuators.  After  eliminating  the  modes  with  low  mode  costs 
that  are  stably  interacting,  the  number  that  must  be  retained  in  any  design 
including  the  rigid  body  modes  is  13.  The  disturbance  due  to  the  solar  array 
motion  is  modeled  as  a lightly  damped  second  order  system  excited  by  white 
noise  and  each  actuator  and  sensor  is  modeled  as  a first  order  system  with  an 
associated  white  noise  disturbance  (either  a measurement  or  an  excitation). 
The  combined  state  dimension,  with  all  of  these  dynamics,  becomes  42.  The 
control  gains  were  selected  to  provide  the  smallest  number  of  gains  possible. 
Thus  the  individual  actuators  were  given  inputs  from  only  those  sensors  that 
measure  the  variables  in  the  same  degree  of  freedom  (i.e.  the  pitch  reaction 
wheel  used  gains  only  on  the  pitch  attitude  and  rate,  etc.).  The  resulting 
gains  are  shown  in  Table  I.  The  Table  also  shows  the  full  state  control  cost, 
which  was  line  of  sight  motion  of  the  antenna,  compared  with  the  same  cost 
from  the  reduced  control  design.  As  can  be  seen,  the  costs  are  within  a few 
percent  of  each  other,  even  though  there  are  only  12  gains  in  the  reduced 
state  design  compared  with  598  in  the  full  state  design.  Not  only  that,  but 
the  reduced  state  design  has  essentially  the  same  response  to  the 
disturbances,  the  same  closed  loop  bandwidth,  and  the  same  robustness  as  the 
original  full  state  design. 


Table  I 

Gains  for  the  reduced  state  design 


Rigid  Body  x 
Rotation  y 
Actuators  z 


Sensors  (Pos.  then  Rate) 


Rigid  Body 
Sensors 


not  used 


Antenna 


feed 


-3E6  -2E7 


6E6  -2E7 


■1E5  -2E5 


Antenna  x 
Actuators  y 
z 

Full  state  LOS  error  6xl0"3  rad. 


3E4 


5E4 


7E5 


4E6 


I f j6E2  4E6 

Reduced  state  LOS  error  35x10"'^  rad. 
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CONCLUSIONS 


The  main  point  of  this  paper  is  that  there  exists  a way  of  desicjninq  a 
control  system  using  a finite  model  that  accounts  for  the  distributed  nature 
of  the  control  problem  in  a systematic  way.  This  is  accomplished  by  using  a 
mode  description  of  the  system  and  from  that  description  determining  the 
number  of  modes  that  must  be  retained  based  on  the  performance  and  the  dis- 
turbance descriptions.  The  design  then  uses  a full  order  model  that  includes 
all  of  the  dynamics:  sensor,  actuator,  disturbance  and  flexible  motion.  The 
designs  that  result  are  simple,  robust,  perform  well  and  have  the  added  dis- 
tinction of  being  easy  to  implement. 
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ABSTRACT 

This  paper  is  a brief  introduction  and  modest  survey  to  aid  and  familiarize 
researchers  interested  in  the  use  of  continuum  modeling  procedures  applied  towards 
large  space  structure  technology.  The  use  of  such  structural  models  for  the 
distributed  control  of  large  flexible  lattice  structures  offers  a significant 
advantage  over  a numerical  approach.  Any  references  not  identified  in  the 
bibliography  can  probably  be  obtained  in  individual  articles.  This  survey  was 
completed  in  the  month  of  December  1981. 


INTRODUCTION 


A high  degree  of  repetition  enables  a recurring  bay  or  module  to  fully  identify 
a structural  system.  If  the  module  is  composed  of  a moderate  number  of  structural 
elements,  an  analytical  description  of  the  deformation  response  rather  than  a 
numerical  one  may  be  obtained.  For  example,  domes,  barrel  vaults,  and  booms  are 
typical  repetitive  lattices.  They  exhibit  high  stiffness,  light  weight,  and  allow 
for  architectural  freedom  in  providing  large  span  areas  with  no  Intermediate 
supports. 

A historical  development  of  structural  lattices  is  given  in  reference  [1]. 

Domes  are  given  as  an  example.  In  particular,  it  is  shown  that  gradual  changes 
in  material  strength  and  weight  of  lattice  domes  were  paralleled  by  an  increase 
in  span  areas  and  a decrease  in  rise  to  span  ratios.  The  inevitable  collapses 
which  followed  forced  engineers  to  consider  more  scientific  and  systematic  methods 
of  analysis  and  design.  As  the  number  of  joints  and  members  continued  to  grow,  a 
full  exact  analysis  became  impractical  and  simplifying  assumptions  were  made. 

For  Instance,  determinate  truss  systems  with  pin  joints  were  easier  to  analyze 
than  more  efficient.  Indeterminate  lattices  composed  of  flexural  members  joined 
by  rigid  welds.  Other  designs  may  have  ignored  that  the  collapse  load  of  a full 
structure  due  to  buckling  can  be  less  than  the  load  corresponding  to  the 
maximum  stress  of  an  individual  member  in  the  structure.  Also,  recursive 
procedures  (known  as  relaxation  techniques)  introduced  in  the  mid  thirties  could 
be  carried  out  for  only  a few  cycles  by  hand  because  of  time  and  effort  involved 
for  many  Iterations  required  by  an  exact  solution.  It  is  the  advent  of  the 
computer  and  computer  methods  of  structural  analysis  that  surmounted  many  of  these 
early  problems . 
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Currently,  repetitive  lattices  are  drawing  attention  for  application  in 
space.  References  [2,3]  describe  projects  such  as  large  antennae  and  space 
platforms  for  communication,  satellite  sensory  recognition,  solar  collectors, 
and  future  manned  space  operation  centers.  These  structures  must  be  simple, 
light,  portable,  and  of  a highly  regular  design  to  facilitate  automated  production 
and  assembly.  The  large  sizes  which  are  anticipated  will  also  make  space 
lattices  very  flexible.  It  is  the  flexing  motion  that  will  affect  a satellite's 
orientation  and  performance;  therefore,  it  is  important  to  identify  the  static 
and  dynamic  behavior  of  space  lattices  for  a variety  of  loading  conditions.  The 
large  size  of  the  problem  can  make  full  "direct"  solutions  more  expensive  than 
an  approximate  analytical  approach  which  exploits  the  repetitive  nature  of  the 
lattice.  Analytical  approaches  are  therefore  very  appealing, 


REVIEW  OF  CURRENT  METHODS  OF  ANALYSIS 


Present  techniques  used  to  study  space  lattices  fall  into  three  categories 
[4,5].  These  are  1)  matrix  methods,  2)  discrete  field  techniques,  and 
3)  continuum  modeling.  The  first  consists  of  numerically  solving  a set  of 
algebraic  equations  in  a direct  manner.  The  other  two,  discrete  field  techniques 
and  continuum  modeling,  are  analytical  approaches.  Each  have  attractive  features; 
however,  recent  studies  Indicate  that  the  continuum  modeling  approach  shows  great 
promise. 

Direct  methods  of  structural  analysis  use  matrix  methods  based  on  discrete- 
element  idealization.  Numerical  methods  such  as  the  finite  element  technique  or 
finite  difference  method  belong  to  this  category.  In  addition,  matrix  methods 
may  be  classified  into  displacement  (stiffness)  methods,  force  (or  flexibility) 
methods,  or  a mixture  of  the  two.  An  example  of  displacement  and  force  methods 
is  given  in  reference  [6],  mixed  methods  are  treated  in  reference  [7].  What  all 
these  methods  have  in  common  is  the  numerical  solution  of  a set  of  simultaneous 
algebraic  equations.  Today,  computer  programs  can  quickly  assemble  and  solve  a 
system  of  linear  equations  as  well  as  efficiently  manage  data  within  a limited 
core  space.  However,  the  preoccupation  with  handling  data  can  be  a major  expense 
when  the  system  of  equations  is  too  large  to  be;  stored  in  central  core.  Obviously, 
solving  an  analytical  system  described  by  a fey  equations  is  more  preferrable  to 
solving  a large  system  of  equations  numerically,  especially  if  it  has  to  be  done 
more  than  once. 

Discrete  field  techniques  belong  to  one  class  of  methods  used  to  describe 
lattice  structures  or  a pattern  of  elements  analytically.  The  popularity  for 
this  approach  is  due  to  the  fact  that  the  discrete  nature  of  the  lattice  is 
preserved  in  the  governing  equilibrium  equations.  Furthermore,  exact  closed 
form  solutions  may  be  obtained  for  simple  configurations  that  include  continuous- 
discrete  systems  of  mixed  dimensionality.  In  comparison  with  direct  numerical 
methods,  a discrete  field  analysis  does  not  increase  in  problem  size  as  the 
number  of  basic  modules  in  the  lattice  is  increased.  There  are  two  approaches  to 
modeling  a repetitive  lattice  by  discrete  field  methods.  One  method  models  the 
structural  system  as  a whole  whereas  the  other  describes  the  lattice  by  a typical 
component.  These  procedures  are  referred  to  as  macro  and  micro  methods 
respectively. 


64 


Macro  methods  are  particularly  well  suited  for  problems  where  continuous 
structural  systems  Interact  with  discrete  elements  such  as  ribbed  plates.  The 
model  is  in  the  form  of  summation  equations  for  discrete  systems  or  summation- 
integral  equations  for  mixed  discrete-continuous  systems.  The  limitation  is 
finding  suitable  kernel  functions  that  satisfy  the  system  boundary  conditions. 

In  references  [8-10],  the  method  is  applied  to  the  static  and  stability  analysis 
of  ribbed  plates  and  discrete-core  sandwich  plates.  In  reference  [11],  studies 
are  made  of  a ribbed  hyperbolic  cooling  tower.  By  modeling  the  tower  as  a 
surface  of  revolution,  a two  dimensional  axisymmetric  jiroblem  is  reduced  to  a one 
dimensional  beamlike  problem. 

Micro  methods  apply  primarily  to  lattice  systems  whose  components  are 
regularly  arranged.  The  system  model  consists  of  difference  equations  which 
represent  a recurrence  relation  (expressed  in  terms  of  difference  operators)  for 
a typical  joint.  The  effectiveness  of  this  method  is  demonstrated  in  references 
[12-16]  for  application  in  determining  displacements  and  critical  loads  of  one 
dimensional  planar  trusses.  It  is  interesting  to  note  that  the  author  in  ref. 

[13]  found  it  easier  to  solve  the  analogous  equivalent  beam  problem  by  converting 
finite  difference  equations  into  differential  equations  via  truncated  Taylor 
series  rather  than  seek  exact  closed  form  solutions  for  the  difference  equations 
directly. 

There  are  a number  of  publications  that  deal  with  the  analysis  of  rotationally 
symmetric  shell-type  lattices.  In  ref.  [8]  simple  truss  members  are  used  for  a 
static  analysis  and  static-stability  studies  by  Gutkowski  broaden  the  problem  by 
inclusion  of  flexural-torsional  members  [17-19].  Early  ad  hoc  attempts  by 
Gutkowski  to  achieve  a pure  discrete  field  displacement  method  evolve  into  a 
clear  and  systematic  variational  formulation  that  uses  a discrete-element 
idealization  procedure  in  ref.  [20].  By  taking  advantage  of  the  rotational 
symmetry,  an  exact  one  dimensional  beamlike  description  is  obtained.  The 
procedure  also  includes  a dynamic  analysis.  A similar  vibration  study  is  made 
in  ref.  [21]  in  which  ring  elements  are  used  for  bracing.  Investigation  of  the 
stability  of  a cylindrical  lattice  shell  under  axial  compression  is  made  in  ref. 
[22].  The  intent  of  the  study  was  to  validate  the  range  of  equivalent  continuous 
shell  models.  A noteworthy  point  in  the  study  includes  the  use  of  joints  with 
varying  rigidity  which  are  modeled  as  rotational  springs.  Buckling  studies 
which  include  member  imperfections  are  made  for  shell  and  beamlike  structures 
with  periodic  cross  sections  in  ref.  [23].  A dynamic  stability  problem  for  the 
same  lattices  is  addressed  in  ref.  [24].  With  the  exception  of  ref.  [20],  these 
works  are  confined  to  simply  supported  edge  constraints. 

The  last  group  of  methods  consist  of  approximating  a repetitive  lattice  by 
an  equivalent  continuum.  The  equivalence  is  designed  so  that  the  continuous 
model  exhibits  identical  or  similar  response  as  the  actual  discrete  lattice  for 
equivalent  applied  loads  or  energies.  The  approximation  assumes  that  distances 
between  nodes  are  small  compared  to  the  structure's  overall  dimensions  and 
therefore  may  be  treated  by  differential  operators.  There  are  a number  of 
reasons  for  using  such  an  approach.  One  is  the  familiarity  with  differential 
operators  and  various  class  of  methods  for  solving  differential  equations. 

Secondly,  continuum  theories  of  elasticity  allow  a rational  equivalence  based  on 
qualitative  decisions  that  reduce  the  dimensionality  of  the  mathematical  model 
and  physically  identify  the  nature  of  deformation  (e.g.  warping,  bending,  shear). 
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Another  consideration  are  works  by  Dzienlszewskl  [25,26]  who  shows  that  optimization 
of  lattices  based  on  a continous  model  allows  a straightforward  comparison  of 
stiffnesses  with  lattices  of  different  configurations. 

Two  things  are  noteworthy  of  a continuum  model  as  the  number  of  repeating 
modules  are  Increased:  on  is  that  the  accuracy  of  the  response  improves;  and 
secondly,  continuum  models  (like  discrete  field  techniques)  do  not  increase  in 
problem  size  as  the  number  of  repeating  modules  are  increased.  However,  the  cost 
of  approximating  finite  distances  by  infinitesimal  ones  isn't  free,  therefore  the 
range  of  application  for  the  continuum  model  must  be  well  defined.  Early  continuum 
models  were  unreliable  because  there  was  no  way  of  rationally  assesing  the 
approximation  Introduced.  Basically,  the  methods  range  from  intuitive  approaches 
to  more  recent  ones  having  a rational  basis. 

Intuitive  methods  postulate  the  result  before  the  analysis  is  undertaken. 

By  assuming  the  response  of  the  lattice  is  approximated  by  that  of  classical  plate 
or  shell  model  a priori,  existing  differential  equations  and  constitutive 
relations  can  be  used  for  the  study.  The  approach  may  be  extended  so  that  the 
analogous  grldwork  contributes  to  the  continuum  by  averaging  or  uniformly 
distributing  over  some  area  the  stiffness  properties  belonging  the  families  of 
parallel  members  or  individual  members  within  a repeating  module.  The  stiffnesses 
for  the  overall  effective  continuum,  determined  by  orthogonal  transformation  of 
each  parallel  set  or  individual  members  within  a repeating  module,  are  then  equated 
with  those  of  the  existing  plate  or  shell.  A number  of  works  apply  this  technique 
to  bending  and  membrane  problems  of  grid  and  shell-type  lattices  [22,27-31].  It 
should  be  noted  that  as  the  structure  Increased  in  complexity,  intuitive  methods 
can  be  misleading  and  highly  Impractical.  (See  for  example  ref.  [32]). 

A rational  approach  does  exist.  The  method  basically  replaces  discrete  field 
variables  with  truncated  Taylor  series;  furthermore,  the  local  error  Introduced  by 
truncation  can  be  accounted  for  mathematically.  (See  for  example  refs.  [34,35]). 
There  are  two  approaches:  one  method  converts  governing  difference  equations  into 
differential  equations;  the  second  replaces  strain  and  kinetic  energies  based  on 
discrete  parameters  with  continuous  field  variables  and  their  derivatives  obtained 
from  the  Taylor  series. 

Converting  finite  difference  equations  into  differential  equations  is  studied 
in  refs.  [33-37]  for  single  and  double  layered  grids.  In  addition,  one  work  by 
Bazant  and  Christiansen  show  by  direct  analogy  that  a lattice  with  rigid  joints 
exhibits  couple  stresses  and  antisymmetric  shear  strains  which  is  not  characteristic 
of  classical  elasticity.  This  micropolar  continuum  theory  presented  by  Eringen 
[44]  is  a generalization  of  elasticity  theory  for  which  classical  methods  of 
elasticity  are  a subset.  Converting  difference  equations  into  differential 
equations  works  well  for  simple  problems,  however  it  is  difficult  to  apply  to 
more  complicated  configurations  due  to  the  complex  nature  of  the  resulting 
differential  equations. 

The  energy  equivalence  method  is  a rational  approach  that  transforms 
energies  of  a discrete  lattice  into  a continuum  representation.  This  method  is 
practical  and  simple  because  variational  formulations  allow  the  differential 
equations,  boundary  conditions,  and  constitutive  relations  to  be  determined  for 
complicated  spatial  configurations.  The  effectiveness  of  this  method  is 
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demonstrated  in  a variety  of  works.  References  [38,39]  apply  this  to  beam  and 
platelike  truss  lattices  for  static,  vibration  and  thermal  analyses. 

References  [36,40-42]  consider  beam  and  platelike  lattices  with  rigid  joints 
where  a micropolar  continuum  is  useful.  Lastly,  ref.  [43]  expands  the  range  of 
application  of  continuum  modeling  to  Include  buckling  as  well  as  vibration 
wavelengths  of  deformation  on  the  order  of  the  size  of  the  repeating  module  as 
well  as  large  (global)  wavelengths  of  deformation.  Analysis  of  a repetitive 
beamlike  lattice  which  consists  of  thin-wall  beam  members  with  open  cross  sections 
for  space  application  is  also  explored. 
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Lexington,  MA  02173 

ABSTRACT 

This  paper  reviews  recent  work  wherein  the  design  of  active  structural  con- 
trol is  formulated  as  the  mean-square  optimal  control  of  a linear  mechanical  sys- 
tem with  stochastic  parameters.  In  practice,  a complete  probabilistic  description 
of  model  parameters  can  never  be  provided  by  empirical  determinations,  and  a suit- 
able design  approach  must  accept  very  limited  a priori  data  on  parameter  statis- 
tics. In  consequence,  we  formulate  the  mean-square  optimization  problem  using  a 
complete  probability  assignment  which  is  made  to  be  consistent  with  available  data 
but  maximally  unconstrained  otherwise  through  use  of  a maximum  entropy  principle. 
The  rcimifications  of  this  approach  for  both  robustness  and  large  dimensionality 
are  illustrated  by  consideration  of  the  full-state  feedback  regulation  problem. 

1 . INTRODUCTION 

Most  techniques  so  far  proposed  for  the  design  of  active  structural  control  - 
even  those  which  purport  to  treat  the  problem  of  modelling  error  (see,  for  example. 
Refs.  [1-3])  implicitly  assume,  at  some  stage,  that  all  the  maps  of  the  linear  sys- 
tem model  are  precisely  known.  A nominal  or  base-line  design  (often  quadratically 
optimal)  is  then  predicated  upon  this  deterministic  model.  Unfortunately,  such  a 
design  pertains  to  but  a single  point  in  a suitably  defined  system 'parameter 
space,  while,  due  to  numerous  sources  of  modelling  error  (especially  in  construct- 
ing large  order  structural  models),  the  actual  system  is  almost  surely  to  be  found 
at  some  other  point  in  parameter  space.  This  fact  vitiates  any  claim  to  optimal- 
ity for  numerous  current  design  approaches  and  necessitates  much  ad-hoc  "hedging" 
about  the  nominal  design  in  order  to  recover  basic  system  properties  that  are  lost 
in  consequence  df  parameter  uncertainties.  This  paper  reviews  a recently  devel- 
oped approach  (that  of  minimum  data/maximum  entropy  modelling)  which  directly 
incorporates  the  impact  of  modelling  uncertainty  in  the  design  process  by  use  of 
a stochastic  model  and  seeks  to  achieve  desirable  system  properties  while  accept- 
ing parameter  uncertainties  at  their  a priori  levels. 

To  circumvent  the  deficiencies  of  current  design  schemes,  it  is  first  neces- 
sary to  recognize  that  any  model  (even  a large-order  "verification"  model)  never 
encompasses  the  truth  but,  at  best,  is  a mathematical  statement  of  what  and  how 
much  is  known.  Considered  as  such,  a.^  design  model  must  not  only  specify  nominal 
or  expected  values  of  the  system  parameters  but  must  also  contain  an  admission  of 
prior  Ignorance  regarding  deviations  from  nominal  values.  This  provides  motiva- 
tion for  the  quantification  of  prior  ignorance  by  the  representation  of  the  struc- 
ture as  a linear  system  with  random  multiplicative  parameters. 


* This  work  Is  sponsored  by  the  Department  of  the  Air  Force.  The  U.S.  Government 
assumes  no  responsibility  for  the  information  presented. 
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At  this  point,  it  is  tempting  to  adopt  a minimum  expected  cost  approach 
-i.e.  assume  the  random  parameters  to  be  subject  to  some  given  probability  law 
and  then  design  a controller  which  is  optimal  on  the  average.  Because  of  its 
simplicity  and  relative  familiarity,  we  confine  attention  here  to  mean-square 
optimality,  choosing  the  average  of  a quadratic  functlonil  over  the  parameter 
ensemble  as  the  performance  measure. 

However,  the  problem  is  more  subtle  than  is  suggested  by  the  minimum  expect- 
ed cost  approach.  In  practice,  one  is  never  provided  with  a complete  probability 
assignment  based  upon  empirical  determinations  but  must  rely  upon  a highly  limit- 
ed set  of  available  statistical  data.  Avoidance  of  ad  hoc  assumptions  (and  the 
Invention  of  data  which  does  not  exist)  requires  that  one  induce  a complete  prob- 
ability model  for  the  parameters  which  is  consistent  with  the  data  on  hand  but 
admits  the  greatest  possible  prior  ignorance  with  regard  to  all  other  data.  The 
prescription  for  doing  this  is  quite  straightforward:  first  define  a measure  of 

prior  ignorance,  the  entropy,  and  then  determine  the  probability  assignment  which 
maximizes  entropy  subject  to  the  constraints  imposed  by  available  data. 

Now  this  idea  of  a maximally  unpresumptive  stochastic  design  model  may  be 
carried  still  further,  for  we  may  discern  a minimum  set  of  a priori  statistical 
data  which  is  just  sufficient  to  induce  a well  defined  probability  model,  The 
maximum  entropy  approach  to  modelling  and  controller  design  consists  in  (1)  ack- 
nowledging only  the  minimum  data  set  as  available  (2)  construction  of  the  max- 
imum entropy  model  induced  by  this  data  and  (3)  control  design  through  quadratic 
optimization  under  the  maximum  entropy  statistics.  Note  that  this  approach  ack- 
nowledges only  the  minimum  data  on  parameter  statistics  because  this  lends  to  a 
particularly  tractable  design  model  and  entails  the  greatest  design  conservatism 
in  that  the  model  is  maximally  chaotic. 

The  minimum  data/maximum  entropy  model  accounts  directly  for  a priori  uncer- 
tainties without  resort  to  ad  hoc  measures  and  provides  a mechanism  whereby  the 
least  possible  prior  information  may  be  incorporated  within  the  control  design. 

As  suggested  by  the  specific  results  reviewed  in  the  following  sections,  this 
approach  can  provide  robustness  levels  comparable  with  modelled  levels  of  uncer^ 
talnty  and,  most  importantly,  its  use  has  profound  ramifications  for  the  problem 
of  large  dimensionality  in  connection  with  the  burden  of  design  computations. 

2.  Stochastic  Modelling  and  Mean-Square  Optimization;  General  Features 

In  this  and  the  following  sections,  the  steps,  outlined  above,  for  construc- 
tion of  a maximally  unpresumptive  stochastic  model  are  described  more  specifically. 
Actually,  the  following  considerations  constitute  extension,  originally  given  in 
Ref.  C4ll,  of  various  initial  results.  Refs.,  C5,6H,  pertaining  to  a very  restric- 
ted class  of  parameter  uncertainties.  Although  future  developments  for  the  treat- 
ment of  general  forms  of  uncertainty  may  be  anticipated,  we  shall  confine  attention 
here  to  the  results  of  Ref.  wherein  parameter  deviations  of  a skew-hermit ian 

form  were  treated.  Accordingly,  consider  the  linear  system; 

X = (A  + ia(t)  -F)x  + w , x e t e Et^ , t^H  (1) 
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where  x comprises  the  state  of  the  plant  together  with  that  of  any  linear  dynamic 
compensation  that  might  be  employed.  'K  denotes  the  nominal  or  expected  vhlue  of 
the  plant  dynamics  map,  where 

A + ^ < 0 (2) 

is  assumed.  a(t)  represents  that  portion  of  the  system  dynamics  which  is  subject 
to  modelling  error  and  a priori  uncertainty,  where 

a(t)  = a (t)  e R (3) 


To  admit  a sufficiently  broad  functional  class  for  a(t)  (as  it  is  represented  in 
the  design  model)  we  additionally  assume  that  a(t)  is  a zero-mean  stationary, 
random  process  which  is  separable,  of  bounded  total  power  and  is  such  that  the 
Integral  increments: 

/H 

da(T)  (4) 


possess  moments  of  all  orders  for  all  finite 


The  above  assumptions  regarding  A and  a(t)  permit  a reasonably  tractable 
point  of  departure  and,  moreover  render  (1)  applicable  to  structural  systems 
conjoined  with  a variety  of  controller  forms. 

Further,  w(t)  in  (1)  denotes  white  disturbance  and/or  observation  noise  of 
intensity  V while  F represents  the  effect  of  control  action  and  is,  in  some  degree, 
subject  to  design  choice.  With  these  provisions,  we  should  like  to  incorporate 
parameter  uncertainties  (as  embodied  in  a(t))  in  the  design  formulation  by  use 
of  an  explicitly  statistical  model  combined  with  a measure  of  performance  de- 
fined on  the  parameter  ensemble.  As  noted  in  section  1,  we  adopt,  as  the  per- 
formance measure,  the  average  of  a quadratic  functional 


J 


lim 


R = > 0 


(5) 


where  EC./J  denotes  an  average  over  both  disturbance  and  parameter  ensembles,  the 
time  invariant  matrix  R encompasses  the  desired  penalty  on  the  state  and  control 
Inputs,  and  consideration  is  limited  to  the  steady-state  case  wherein  F is  time 
invariant . 


Here  we  note  that  the  class  of  time  invariant  F which  renders  (5)  finite 
and  (1)  second-mean  stable  is  nonempty. 


Under  the  above  assumptions,  it  is  desired  to  determine  the  unconstrained 
parameters  of  F so  as  to  minimize  J.  This  problem  may  be  simplified  somewhat 
by  a restatement  in  terms  of  the  co-state  matrix.  Straightforward  manipulations 
give: 
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: J = [PV] 

P = pM  E [P]  £ 

o = (A  + la{t)  - F)^  P + P (A  + la(t)  - F)  + R 
However,  largely  by  vitttie  of  the  linearity  of  system  (1),  it  can  be  shown 

that: 

E[(  ^a(t))^  P + P ^a(t)]  = HM  (7) 

where  H [].]  is  a linear  mapping  of  the  class  of  hermitian  matrices  onto  itself. 
Thus,  (6)  may  be  recast  in  the  form: 


J = ^[PV] 

r 

0 = (A  - F)'^  P + P (A  - F)  + H [P]  + R 


(8) 


where,  at  least  in  principle,  all  requisite  ensemble  averages  have  been  carried 
out  and  the  problem  is  posed  solely  in  terms  of  the  expected  cost  matrix,  F. 
However,  (see  the  explicit  expressions  (26)  and  (27)  of  Ref.  CS]]) , the  evaluation 
of  HC.D  demands  a complete  statistical  model  of  a(tj^,  12),  i.e.,  knowledge  of 
the  joint  probability  distributions  of  all  orders  of  all  countable  sets  of  incre- 
ments of  the  form  (4).  But,  such  a complete  probability  assignment  is  never 
actually  available  from  empirical  determinations.  In  practice,  one  is  provided 
(or  strives  to  obtain)  a very  limited  and  incomplete  set  of  statistical  data  on 
the  uncertain  parameters.  Typically,  this  available  data  may  comprise  estimates 
of  the  nominal  or  mean  values  of  system  parameters  together  with  various  simple 
measures  of  statistical  deviation  from  the  nominal  values.  In  connection  with 
available  data,  it  is  well  to  set  forth  certain  ground  rules  at  this  point  by 
recognizing  that  however  little  is  known  regarding  uncertainties  in  system  dy- 
namics some  pattern  in  a priori  knowledge  is  almost  always  induced  by  the  proced- 
ures used  in  the  construction  of  the  system  model.  In  devising  a particular  model 
as  the  embodiment  of  physical  law,  the  modelling  analyst  must  not  only  estimate 
the  nominal  or  expected  values  of  system  characteristics  but  must  also  identify 
those  physical  mechanisms  with  respect  to  which  his  analysis  is  prone  to  signif- 
icant error.  This  last  step  is  clearly  necessary  since,  ultimately,  the  validity 
of  the  model  must  be  checked  over  the  possible  range  of  variation  of  the  uncert- 
ain effects.  Within  the  context  of  linear  systems,  the  poorly  modelled  effects 
may  often  be  represented  by  uncertainties  in  parameters  which  enter  linearly  into 
the  system  dynamics.  In  other  words,  variations  in  the  dynamics  map  from  its 
nominal  value  due  to  errors  in  the  modelling  of  a particular  physical  effect 
assume  the  form  aB,  where  B £ C 2nx2n  ig  ^ known  matrix  and  a is  an  uncertain 
scalar.  In  total,  uncertainty  in  the  dynamics  may  be  represented  by: 

^ " fj 
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where  the  a,  . are  uncertain  scalars. 
kl 

In  addition  to  identifying  the  "weak  spots"  in: his,  system  description,  the 
modelling  analyst  must  ultimately  estimate  bounds  or  statistical  measures  of  the 
magnitudes  of  the  aj^j . Such  estimates  are  usually  performed  with  reference  to 
each  a^j  separately,  i.e.,  in  a statistical  context,  the  a priori  data  (which 
the  analyst  either  presumes  on  the  basis  of  modelling  judgement  or  strives  to 
determine  empirically)  comprises  assigned  numerical  values  of  linear  functionals 
of  p where  p [aj^j]  denotes  the  probability  density  of  aj^j  . For  very  large 

order  systems  this  is  perhaps  the  only  practicable  approach,  since  the  large  num- 
ber of  uncertain  parameters  precludes  any  empirical  determination  of  the  joint 
statistics  of  several  of  the  parameters',  simultaneously. 

Thus,  we  argue  that  in  the  process  of  constructing  a model  and  assessing 
its  accuracy,  the  analyst  necessarily  induces  a set  of  "directions"  (that  is,  the 
set  of  the  deterministic  matrices,  j » in  (9))  which  describes  the  geometric 
articulation  of  modelling  uncertainties  and  a priori  data.  Under  the  stipulation 
that  only  marginal  statistics  of  the  aj^^  separately  are  available,  the  By; 

may  be  said  to  specify  the  "a  priori  Information  pattern"  arising  from 
the  model  and  constitute  an  essential  element  of  available  knowledge.  Notfe  that 
we  retain  for  the  ay(t)  all  the  assumptions  mentioned  above  for  a(t). 

Having  discerned  the  a priori  pattern  of  available  information,  one  might 
ask  if  it  is  possible  to  avoid  ad  hoc  assumptions  regarding  unavailable  parameter 
statistical  data  by  constructing  a full  probability  model  which  is  consistent  with 
the  data  at  hand  but  which  presumes  as  little  as  possible  with  regard  to  unavail- 
able data.  Proceeding  further:  "Is  there  a set  of  statistical  data  which  is 

just  sufficient  to  induce  any  well-defined  maximally  unpresumptive  stochastic 
model?"  These  are  the  basic  modelling  questions  relating  to  the  optimization 
problem  (8),  that  we  address. 


3 . The  Minimum  Data/Maximum  Entropy  Model 

Now  consider  the  problem  of  constructing  a stochastic  model  of  system  (1) 
which  replicates  available  data  while  admitting  the  maximum  degree  of  prior  ignor- 
ance regarding  all  other  data. • The  successful  principle  enunciated  by  Jaynes 
H7,8H  has  Immediate  application  here:  the  desired  probability  assignment  is  the 

one  which,  under  the  constraints  imposed  by  available  data,  can  be  realized  in 
the  maximum  number  of  ways  or,  equivalently,  maximizes  an  appropriate  measure  of 
prior  ignorance  (the  entropy). 

We  may  Introduce  measures  of  information  reposed  in  the  statistics  of  para- 
meter deviations  in  the  following  way.  First,  note  that  under  the  a priori  pat- 
tern of  (9),  only  Increments  of  the  form: 
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enter  the  evaluation  of  H C.H  and  thus  have  a bearing  upon  the  optimization 
problem.  Next,  define  a division 


(kj ) ^ (k j ) 


:(kj) 


N. 


kj 


(k,j)  e r?  D ^ J [r?] 


(10) 


of  the  real  line,  where  ?7  is  a set  of  ordered  pairs  {(k,j);  ksjsn}  and  de- 
note the  set  of  increments  (over  right  semi-closed  intervals): 


kj  o 


t(k,j)) 

2 ’ 


kj^  N,  .-1’ 
kj 


(k,  j) 


"N. 


kj 


(11) 


by  {a  ;J  C^D}.  Finally,  let  P(  { a; J C^j})  denote  the  joint  probability  density 
of  the  increments  (11).  Then,  a measure  of  statistical  uncertainty  associated 
with  the  increments  (11)  is  given  by  the  relative  entropy  of  these  increments: 

HCP({a;J  [7?]})]  k -y*  daP({a;J  [77]})ln  P({a;J  [r?]})  (12) 


where  do  is  the  volume  element  in  the  sample  space  of  (11) . Although  it  is 
possible  to  define  a measure  of  entropy  for  the  stochastic  system  as  a whole, 
the  measures  (12)  defined  directly  on  the  a^.(t^,t2)  will  suffice  for  present 
purposes. 


Of  course  we  wish  to  work  with  distributions  constrained  by  statistical  data. 
Let  P denote  the  set  of  available  data  each  element  of  which  consists  of  the 
assignment  of  a numerical  value  to  a particular  linear  functional  of  P({a:J  [(77II}) 
in  a manner  consistent  with  the  a priori  pattern.  Let  P({a:J  C^Dlp)  denote  the 
conditional  density  given  the  data  set  p . Then  the  entropy  of  {a;  3[yi\}  in  the 
presence  of  this  data  is  simply  hCp  ({a^  HT)]]  }|  p)  j in  accordance  with  definition 
(12). 


With  these  notations  we  can  state  that  the  maximally  unpresumptive  probabil- 
ity assignment,  Pp  ({a;  J Et?!!  }) , under  the  constraints  imposed  by  data  is  de- 
f ined  by : 

H[Pp({a;  J[7?]  } )]  = SUP  H[P({a;J[r7]}|P)];  VJC/)]  (13) 

P 

and  it  is  this  probability  assignment  which  avoids  ad  hoc  restrictions  on  the 
available  data. 

But  further,  we  note  the  functional  dependence  of  hEp*(  a;  J )II  upon 
P . Clearly,  distinct  data  sets  generally  induce  distinct  values  of  maximum 
entropy  and  this  implies  an  ordering  of  prior  data  sets.  Thus,  it  is  natural 
to  enquire  whether  or  not  there  exists  a "minimum  data  set",  P CP  , for  which: 
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(14) 


H[pJ‘(  a:J  M )]  = SUPESUP  CP([a;JE^]  } |D)]] 

V*  V p 

for  all  J E^l  and  for  all  t)  such  that  hQp(  ajjJ'C^H  V)1  is  bounded.  If  such  a t?  * 
consistent  with  the  a priori  information  can  be  found,  then  acknowledgement 
of  only  p ” as  available  data  leads  to  a probability  assignment  which  is  maximum 
entropy  over  all  admissible  selections  of  available  data.  The  corresponding 
stochastic  system  model  thus  Incorporates  the  minimum  prior  information. 


Details  concerning  the  resolution  of  the  above  questions  are  to  be  found  in 
Ref.  E4H.  Here  we  merely  state  the  results.  For  the  system  considered,  the  min- 
imum data  set,,  may  be  said  to  comprise  the  nominal  system  maps,  the  matrices 

defining  the  prior  Information  pattern  and  the  so-called  "uncertainty  relax- 
ation times",  T^j : 


(15) 


These  time  scales  correspond  to  essential  information  on  the  scales  of  parameter 
deviations  from  nominal  values.  From  the  assumed  properties  of  the  aj^j(t]_,  tg) , 
it  can  be  seen  that  the  Tj^^  are  real,  non-negative  and  increase  without  bound 
as  the  limit  of  the  deterministic  dynamics  map  is  approached.  Thus  we  may  say 
that  the  reciprocal  relaxation  times,  l/Tj^j(k,j  = l,...,n),  constitute  fundament- 
al, albeit  unconventional,  measures  of  parameter  uncertainty  consistent  With  the 
a priori  information  pattern. 


The  above  data  set  is  minimum  in  the  following  sense.  If  available  data 
lacks  any  element  of  this  set,  there  exists  no  maximum  entropy  probability  assign- 
ment under  which  all  the  aj^j  (tj^,  12)  possess  finite  even-order  moments  for 
finite  |t2  “ tq  | . Indeed  in  this  circumstance,  the  entropy  maximum  problem  is 
ill-posed  in  that  available  data  imposes  no  bound  upon  HElp'({a:J  E??II})I1  . Thus, 
the  and  Tj^.  must  be  made  available  in  order  to  construct  a full  probability 
model  in  a rational  manner.  On  the  other  hand,  for  any  V D p*  and  for  all 


SUP  HEp(  {a:J  E^]  }|P)]  < SUP  HEP({a:  jE^]}fp*)] 


(16) 


Thus,  a truly  "minimum  information"  model  is  obtained  by  acknowledging  only  the 
minimum  data  set,  P”,  as  available.  Taking  this  step  also  results  in  a design 
model  which  is  emlnantly  tractable  in  optimization  computations.  In  fact,  the 
assignment,  Pp*  , Induced  by  P”  yields  a form  of  Stratonovich  state  dependent 
noise  E9l  for  which  system  (1)  has  the  Ito  differential: 


dx(t)  = (A^  - F)  x(t)dt  + daj^^  (o,  t)B^jX(t)  + dW(t) 


A A A - 1- 

m ~ 


'2  ^ B^ 

k,j  T^.  kj 


(17) 
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where  the  a^-s  (o,t)  are  mutually  independent  Wiener  processes  with  intensities, 
1/Tvj . Furthermore,  under  this  minimum  data/maxiraum  entropy  model,  the  operator 
^ L-II  appearing  in  the  optimization  problem,  (8)  , -'assumes  the  form: 


H^CP] 


in 


CP, 


a. 


b. 


(18) 


where  LA,B]  denotes  the  commutator: 

[A,B]  4 AB  - BA  (19) 

As  noted  in  Ref.  when  the  scale  of  a-^.  (o,t)  is  taken  ho  be  large 

H [fH  represents  the  leading  term  in  the  as3mi]^totic  expansion  of  [FH  under 
the  actual  parameter  statistics  (as  distinguished  form  the  maximum  entropy 
statistics).  In  other  words,  the  minimum  data/maximum  entropy  model  manages  to 
preserve  various  asymptotic  properties  of  the  expected  cost,  F,  for  large  levels 
of  parameter  uncertainty.  Thus,  the  design  consequences  of  very  great  uncertain- 
ties may  be  readily  investigated. 


Combining  the  above  results,  the  mean-square  optimization  problem,  (8), 
under  the  probability  model  (17)  assumes  the  form: 


0 


mp  : J = 

= (A  - F)^P  + P(A  - F)  + X]  -2^  ^ 

h,  j kj 


(20) 


With  this  result,  the  minimum  data,  maximum  entropy  stochastic  model,  (17), 
may  be  directly  applied  to  the  mean-square  optimal  design  of  a variety  of  con- 
troller types  (e.g.,  full  state  feedback  regulation,  dynamic  dompensation  with 
output  feedback,  etc.).  One  such  development  for  structural  systems  is  described 
in  the  following  sections. 


4.  Application  to  Structural  Systems  - The  Mean-Square 
Optimal  Full-State  Feedback  Regulation  Problem 

Now  we  Illustrate  the  application  of  the  above  results  to  structural  control. 
For  simplicity,  consideration  is  limited  to  flexible  mechanical  systems  under- 
going deformations  in  the  linear  range  with  no  rigid  body  degrees  of  freedom  and 
negligible  gyroscopic  effects.  Generalizations  to  include  rigid  body  and  gyro- 
scopic effects  are  straightforward  and  will  not  be  addressed  here. 

To  further  fix  ideas,  we  consider  the  case  in  which  the  mass  distribu- 
tion, internal  damping  and  the  sensor/actuator  dynamics,  locations  and  alignments 
are  precisely  known.  While  only  the  elastic  operator  of  the  structure  is  to  be 
considered  subject  to  a priori  uncertainty,  this  uncertainty  reflects  some  of  the 
most  Important  sources  of  structural  modelling  errors.  The  elastic  operator  em- 
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bodies  detailed  information  on  such  properties  as  material  characteristics, 
geometry,  construction  and  structural  joint  conditions  which,  in  absence  of 
extensive  testing,  are  difficult  to  ascertain  with  precision. 

It  is  further  supposed  that  the  original  distributed  parameter  equations 
for  the  structure  are  resolved  in  the  basis  of  N (N  finite  but  large)  modal 
coordinates  associated  with  a set  of  mode  shapes  for  the  nominal  system  (the 
structure  with  its  nominal  values  of  parameters).  In  this  fixed  modal  basis 
(independent  of  parameter  uncertainties),  we  may  write  the  state-space  equation 
in  the  eigen-basis  of  the  nominal,  open-loop  plant  as  j (see  Ref.  CllH  for  a 
detailed  derivation) : 

X = fix  +ySu  + w ; x e (21) 


where  /Sis  the  control  input  map  and  w is  a white  disturbance  of  intensity  ¥>  0 . 
fi  has  the  form: 


-iU-VQ,  o 

o -Ail-rfil 


(22) 


where  ^ and  0 are  diagonal  matrices  comprising,  respectively,  the  modal  damping 
ratios,  K=l.  ..,  N,  and  the  nominal  or  expected  values,  K=  1,  .,,  N , of  the 
modal  frequencies.  Owing  to  uncertainty  in  the  structursfl  stiffness,  is  sym- 
metric and  positive  definite  but  not  diagonal.  However,  the  expected  value  of  0 
is  fi  so  that  the  off-diagonal  elements  of  12  - Q are  zero-mean  random  perturb- 


ations. 


With  regard  to  the  control  optimization  formulation,  we  explicitly  consider 
here  only  the  very  simplest  controller  form  - i.e.  full  state  feedback  regulation: 

u = - Kx  (23) 

with  the  time- invariant  gain,  k , chosen  to  minimize: 

J = E[x^  <j^x  + u'^  R^u  3 a ^>0,  > 0 (24) 

subject  to  (21) . This  restriction  is  made  purely  for  illustrative  purposes  - a 
companion  paper,  Cl2j,  describes  application  of  the  maximum  entropy  approach  to 
far  less  idealized  controller  forms.' 


Now,  note  that  because  an  ensemble-independent  functional  basis  is  employed, 
fils  deterministic  and  only  f2  is  subject  to  statistical  variation.  Moreover, 
from  the  above  definitions,  it  is  seen  that  fi  — Ji,  where: 


^ = eM  = diag  C • • • > ~ ^N’ 


(25) 


is  a zero-mean  skew-hermit ian  variation  of  the  form  appearing  in  (1).  Thus, 
the  maximum  entropy  modelling  approach  described  in  the  preceding  section  can  be 
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Immediately  applied  here. 


To  illustrate  most  simply  the  qualitative  ramifications  of  the  maximum  entropy 
stochastic  modelling  approach,  we  suppose  here  that  in  the  vector  basis  of  (21) , 
all  the  elements  of  y-y  are  a priori  subject  to  significant  uncertainty.  Thus, 
in  conformity  with  the  notation  of  (9)  we  write: 


B = 


0:k>N,  j<N  ork<N,  j>N 


(26) 


where  e^^  ^ ^^2Nx2N  matrix  whose  elements  are  zero  except  for  the  (k,j)^^ 

which  is  unity. 


With  this  pattern  of  a priori  information,  non-trivial  relaxation  times 
may  be  given  as: 

- bk  ■ ‘27) 

Acknowledging  only  fi  , v , the  B|^,  and  the  Tj^^j  as  available  data,  there 
immediately  results  a maximum  entropy  modei  of  the  form  (17).  Of  more  immediate 
interest,  however,  is  the  counterpart  of  (20)  in  the  present  context.  Defining: 


^m 


A 


M - h 1 


J 


A 


^ 1 

diag  [ ^ ;= — ] 

k j=l  ^kj 


(28) 


and  denoting  the  expected  cost  matrix  by  p , the  optimization  problem  embodied 
in  (21)  - (24)  under  the  minimum  data/maximum  entropy  model  of  (21)  (consistent 
with  the  prior  pattern  (26))  can  be  shown  to  assume  the  form: 


min  : J = trCp  v]  (29) 

K 


with  p ^0  satisfying: 


0 = ( Pm  P +P  ~^K)  + (30) 

where 

1 

D[p]]  = T^  ^\j  P®kj  \+N,j+N  P\+N,j+N]  > 0 

j k ""3 


with  B,  . as  defined  in 
k3 


(26) 


(31) 


-1 


approach  zero  (the  limit  of  a deter- 
(30)  reduces  to  the  familiar  Lyapunov  equation. 


We  pause  to  note  that  when  all  the  Tj^^ 
ministic  plant)  --7. 
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Equations  (29)  - (31)  now  represent  a rather  tractable  variational  problem 
and  we  devote  the  remainder  of  this  paper  to  a review  of  its  properties. 

Denoting  ySR2^  ^ > ()  by  a 2>  main  results  on  optimality  and 

stability  may  be  summarized  as  follows: 

Theorem  1 ^ Consider  (29)  - (31)  with  Jl , JI^,  t5i»  R2  and  02  as  defined 

above.  If  (crq  , JJ  ) is  reconstructible,  then 

k = (32) 

minimizes  the  performance  index  (29),  where  p is  the  unique  positive  definite 
solution  of 

0 =P^P  ^ P ra  ~P^2P 


Moreover,  with  (32),  the  closed-loop  system  under  the  minimum  data/maximum  en- 
tropy statistics  is  second  moment  and  almost  surely  exponentially  stkble  for 
all  > 0.  (For  a discussion  of  this  and  other  stochastic  concepts  of  stab- 

ility sie  Ref . C13H  ) . 

Proof : See  Theorems  1 and  2 of  [ll]. 

Thus,  under  reasonably  mild  restrictions,  the  stochastic  design  approach 
yields  a well  posed  optimization  problem  and  affords  considerable  assurance  of 
closed-loop  stochastic  stability. 

Equation  (33),  termed  the  "stochastic  Riccatl  equation",  is  seen  to  be  the 
central  design  equation  for  mean-square  optimal  regulation  under  the  minimum 
data/inaximum  entropy  model.  Note  that  the  familiar  L Q gain  relation,  (32),  is 
preserved  and,  in  the  deterministic  limit  (Tj^l  — » 0, V k, j ) , (33)  reduces  to  the 
standard  Riccati  equation.  Additional  qualitative  properties  are  noted  in  EsU 
and  P-4E1  , moreover,  Refs.  Cs]  and  ElSl  describe  several  numerical  procedures  for 
for  solution  of  (33) , most  of  them  displaying  guaranteed  linear  or  quadratic  con- 
vergence. Ref.  II15II  gives  illustrative  numerical  results  which  reveal  dramatic 
qualitative  differences  between  (33)  and  the  standard  Riccati  equation  - partic- 
ularly at  modest  to  high  levels  of  parameter  uncertainty.  Such  properties  are 
the  subject  of  the  next  section. 


5.  The  Stochastic  Riccati  Equation  - Asymptotic  Properties  for  Large 
Uncertainties 


Having  determined  that  under  the  conditions  of  Theorem  1,  a unique  positive 
definite  solution  of  (33)  exists  for  all  positive  : k,  j = 1,...,N,  it  is 

natural  to  inquire  what  behavior  "p  attains  for  large'^uncertainties , i.e.,  iror  very 
small  relaxation  times. 
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Defining: 


{M}  4 diag 


(34) 


for  any  square  matrix,  M,  we  first  consider  the  case  wherein  there  is  very 
great  uncertainty  with  regard  to  the  open  loop  frequencies  of  (21)  (T^J  very 
large) , 


Theorem  2 


Assume  the  conditions  of  Theorem  1 and  introduce  the  positive 


scaling  factor,  0 , into  T^j^: 


'^kk  ^\]a  ’ ^ ^ 


(35) 


Then,  with  p the  positive  definite  solution  of  (33): 


A.  lim  p =i  p*}  = 

a 10 


r A 

{P} 


0 

A 

{P} 


(36) 


where  {p}  is  the  positive  definite  solution  to: 


“ - ^kk  £ 'ir  ' - ^kk^  + ^ikk-  ^2kk  ^k 


£=#=k  kZ 


; k = 1, , . . ,N 


(37) 


and  where 


rA 


{d.}  = {a,} 


(38) 


- * 

B.  With  p= Ip  },  u given  by  (32)  is  a rate  feedback  law  under  which 
system  (21)  is  stable  for  all  stable  p.  of  the  form  (22)  such  that  p — p is 
diagonal. 


Proof:  Theorem  3 of  Cllll.  See  also  Theorem  16  of  C5l|. 

Thus,  under  very  large  levels  of  uncertainties  in  the  open-loop  frequencies, 
the  solution  of  the  stochastic  Rlccati  equation  automatically  reduces  to  a rel- 
atively simple  as3rmptotic  form  in  which  the  expected  cost  matrix,  p , is  diagonal 
in  the  eigen-basis  of  the  nominal  system  dynamics  map.  This  asymptotic  solution 
gives  rise  to  a rate  feedback  control  law  which  is  stable  regardless  of  the 
values  of  open-loop  frequencies  and  structural  damping  ratios.  Thus,  inclusion 
of  a class  of  parameter  uncertainties  within  the  stochastic  design  model  results 
in  a mean-square  optimal  design  which  is  robust  in  the  presence  of  the  modelled 
uncertainties.  The  above  theorem  show  that  frequency  uncertainties  may  always 
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be  modelled  with  sufficient  conservatism  that  the  control  approaches  an  inherently 
robust  law  arbitrarily  closely. 

The  asymptotic  properties  of  the  state  covariance  are  entirely  analogous  to 
those  of  p , and  we  may  say  that  the  principal  effect  of  open-loop  frequency  un- 
certainties is  to  suppress  cross-correlation  among  distinct  structural  modes. 
Additional  effects  arise  when  we  consider  the  random  intermodal  coupling  intro- 
duced by  uncertainty  in  the  off-diagonal  elements  of  Q in  (22)  . These  new  char- 
acteristics are  illustrated  by  the  case  in  which  all  the  relaxation  times 
approach  zero: 


Theorem  3 Let  p be  the  positive  definite  solution  of  (33)  under  the  conditions 

of  Theorem  1 and  introduce  the  positive  scaling  factor,  O into  T . : 


^ (0,oo)  , ¥ k,  j 


(39) 


Then: 


A 


A.  lim  p = pi 

<yj.o 


p is  the  positive  scalar: 


jo  - [-A  + ( 02  ) T / 02 


(40) 


(41) 


where 


A 1 N 
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A 1 N 

- A 3^  ^ 
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^ i 
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2kk 


and 


where  {0^}  and  {02>  are  as  defined  in  (38) 


(42) 


“1  H ^ 

B.  With  K = R ^ p in  (23),  the  control  is  a rate  feedback  law  which  is 
stable  for  alli2=  i?T>0  in  (22). 


Proof:  Theiorem  4 of  CllH. 


Thus,  in  addition  to  the  diagonalization  of  p,  large  uncertainties  in  all 
the  elements  of  the  structural  stiffness  tend  to  suppress  differences  among  the 
diagonal  elements  of  p . As  can  be  seen  from  (40)  and  (41) , the  asymptotic 
solution  no  longer  distinguishes  the  separate  roles  of  individual  modes.  Indeed, 
we  obtain  a solution  corresponding  to  undifferentiated  chaos  in  which  the  asymptotic 
mean-square  optimal  control  seeks  to  minimize  a performance  index  characterized  by 
an  averaged  (i.e.,  averaged  over  the  modes  of  the  system)  state  weighing, 
and  input  weighting,  for  a system  possessing  an  average  damping,  A • 
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Part  B shows,  once  again,  that  we  may  always  choose  the  relaxation  times 
sufficiently  small  that  the  control  approaches  an  Inherently  robust  form  as 
closely  as  desired.  In  contrast  to  Theorem  2.B,  however,  we  now  have  the 
assurance  of  stability  in  the  face  of  uncertainties  in  all  elements  of  Q . 
Moreover,  the  asymptotic  form  of  the  control  can  actually  be  implemented.  If 
one  has  rate  sensors  co-located  with  actuators:  then 

y = X (43) 

and  the  asymptotic  control  of  Theorem  3 may  be  written: 

^ -1  H -S'  -1 

u = -J&R2^  x = -^R2  y (44) 

^ -1 

Since  p ^2  > ^ this  is  precisely  the  direct  rate  output  feedback  law  dis- 

cussed by  Balas  El6l]  and  is  known  to  be  stable  in  the  face  of  uncertainties 
in  all  modal  parameters . 


In  summary,  the  above  results  provide  a new  theoretical  justification  for 
the  use  of  rate  feedback  in  structural  control  and  Illustrate  the  general  prin- 
ciple: In  the  presence  of  very  great  parameter  uncertainty,  the  mean  square 

optimal  control  within  a minimum  data/maximum  entropy  stochastic  model  is  a 
control  which  is  inherently  energy  dissipative. 


6.  Incoherence  and  Isotropy:  Treatment  of  High  Order  Systems. 

The  specific  results  of  the  last  section  are  mainly  concerned  with  the  case 
In  which  the  relaxation  times  are  all  uniformly  small.  It  is  more  typically  the 
case,  however,  that  certain  groups  of  modes  are  rather  well  known  while  the 
remaining  modes  are  endowed  with  highly  uncertain  parameters.  To  fix  ideas, 
suppose  that  the  open  loop  modes  of  (21)  are  arranged  in  order  of  increasing 
nominal  modal  frequencies.  Furthermore  to  reflect  a progressive  degradation  in 
the  modelling  accuracy  for  higher  order  modes  assume  that  the  quantities: 

N 

?k  "S  ■ Ck+1  > !k 

3-1 

increase  monotonically  with  k.  Under  these  circumstances,  various  qualitative 
features  in  line  with  Theorem  17  of  Ref.  C5[]  are  to  be  expected  and  will  be 
stated  here  without  proof. 

Considering  a design  model  of  finite  but  arbitrarily  large  dimension  under 
the  above  conditions,  we  have  that  given  €>0  there  exists  an  Nq  (c)  sufficiently 
large  that: 

II  yO-  p||  < € IIP  II  (46) 
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where  p is  the  solution  to  (33)  and 


A 

P A 


P 0 
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0 {0j> 
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P 0 

Cl2 
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p 0 

Ci2 
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rA 


(47) 


Pell’  Pcl2’  Pc22’  £ ^ 


N xN 
c c 


determined  according  to  a N^xN^  stochastic 


with 

Riccati  equation  of  the  form  (33)  pertaining  to  the  first  modes  and  ipjjis  Of 
the  asymptotic  form  (36)  . In  brief,  if  modeled  uncertainty  levels  increase  with 
increasing  modal  order  in  accordance  with  (45),  the  resulting  mean^-square  optimal 
control  for  the  high  order,  very  uncertain  modes  apprdachps  the  asymptotic  rate 


feedbacl',  form  of  Theorem  2,  Furthermore,  if  the 
we  Tfiay  again  assert  (46)  - (47)  but  with: 


kj’ 


k = j are  sufficiently  small. 


{Pj} 


{p;} 


A 


(48) 


In  other  words,  the  control  for  very  high  order  modes  may  approach  the  asymptotic 
form  of  Theorem  3 . 


Under  the  conditions  stipulated  above,  the  state  covariance  matrix,  Q will 
approximately  attain  the  forms  analogous  to  (47)  and  (48) for  e sufficiently 
small.  Modes  or  order  greater  than  (e)  are  said  to  constitute  the  Incoherent 
range*  and  are  approximately  mutually  uncorrelated.  Lower  order  modes  which  re- 
tain significant  cross-correlation  are  termed  "coherent"  modes.  Of  course,  with 
T]^j  sufficiently  small,  modes  may  be  found  such  that  a result  for  Q analagous  to 
(40)  holds,  and  these  modes  comprise  the  isotropic*  subrange.  As  noted  in  Ref. 
[11],  the  values  of  the  relaxation  times  not  only  delineate  these  important 
qualitative  regimes  but  also  determine  the  rapidity  with  which  the  system  ap- 
proaches, from  an  arbitrary  initial  state,  the  special  statistical  states 
associated  with  (47)  and  (48).  From  this  point  of  view  alone,  the  relaxation 
times  may  be  considered  essential  parameter  statistical  information. 


* This  termiiaology , Introduced  in  earlier  work  C4-6]],  arose  from  analogies 
with  wave  propagation  in  random  media. 
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Obviously,  the  above  properties  for  p and  Q have  important  implications  for 
the  treatment  of  high-order  systems.  Supposing  that  the  parameters  associated 
with  only  a relatively  few  modes  may  be  considered  well  known,  a priori  norm 
bounds  (see  Ref.  CbD)  may  be  used  to  determine  an  « N sufficiently  large  that 
the  magnitudes  of  all  elements  in  the  error,  Cp  - "p) , incurred  in  the 

approximation : 

K . R2^3  V (49) 


are  as  small  as  desired.  Thus,  to  within  a given  approximation,  the  solution  of 
(33)  for  N large  may  be  obtained  by  solution  of  a reduced  order  (N^xN^,)  stochastic 
Rlccatl  equation  together  with  solution  of  N-N(,  equations  of  the  form  (37)  o^  eval- 
ation  of  closed-form  expressions  (41),  corresponding  to  the  incoherent  and  isotropic 
ranges,  respectively. 

Since  the  solution  of  (37)  or  (41)  requires  little  or  no  computational  effort, 
the  main  task  is  the  solution  of  the  reduced  order  version  of  (33)  for  the  coherent 
modes.  In  practice,  it  is  likely  that  N will  be  modest  (~10  - 20)  and  thus  the 
computational  burden  will  not  be  excessive  even  for  very  high  order  design  models. 
For  uncertainties  of  a more  restricted  class  than  considered  here,  this  possibility 
has  already  been  demonstrated  by  specific  numerical  results  for  a variety  of  design 
examples . 

7 . Concluding  Remarks 

This  paper  has  reviewed  the  basic  ideas  of  the  minimum  data/maximum  entropy 
modelling  approach  and  displayed  its  application  to  structural  systems  having 
significant  a priori  uncertainty  in  the  stiffness  operator.  To  Illustrate  the 
design  consequences  of  the  resulting  stochastic  model,  the  problem  of  mean-square 
optimal,  full-state  feedback  regulation  was  oonsidered.  Treatment  of  the  less 
idealized  problem  of  fixed-order  dynamic  compensator  design  under  the  maximum 
entropy  approach  is  reserved  for  a companion  paper. 

For  the  stochastic  Riccati  equation  arising  from  the  regulator  problem, 
sufficient  conditions  were  given  for  existence  and  uniqueness  of  solutions. 

From  the  results  of  Section  4 one  has  assurance  of  stochastic  stability  for 
the  closed-loop  system.  Moreover,  by  virtue  of  the  asymptotic  properties  given 
in  Section  5,  uncertainty  levels  may  always  be  modelled  with  sufficient  conser- 
vatism to  secure  stability  over  the  actual  (as  distinct  from  the  minimum  infor- 
mation) parameter  ensemble. 

Finally,  the  special  structure  of  the  stochastic  Riccati  equation  gives  rise 
to  a qualitative  distinction  among  coherent,  incoherent  and  isotropic  modes  and 
consequently  permits  the  use  of  very  high  order  models  in  regulator  design  comp- 
utations. These  features  have  significant  consequences  for  the  problem  of  large 
dimensionality.  In  control  design  for  large-order  linear  systems,  the  "curse  of 
dimensionality"  is  manifested  in  the  great  mass  of  processing  of  fundamental  data 
(of  system  models  presuming  complete  information)  required  for  the  formulation  of 
an  optimal  control  policy.  The  results  described  above  intimate  the  possibility 
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that  by  consistent  use  of  design  models  incorporating  limited  system  infor- 
mation, we  may  so  arrange  matters  that  the  processing  required  for  control 
policy  formulation  may  be  similarly  limited. 
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DISTRIBUTED  SYSTEM  MODELING  OF  A 
LARGE  SPACE  ANTENNA 
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California  Institute  of  Technology 
Pasadena,  CA  91109 

ABSTRACT 

A general  approach  for  distributed  parameter  modeling  of  complex  dynamical 
systems  is  described.  The  method  consists  of  dividing  the  system  in  parts 
which  can  be  modeled  by  simple  partial  differential  equations  and  coupling 
the  equations  thus  obtained  by  applying  Hamilton's  variational  formalism  to  the 
entire  system.  The  modeling  of  a large,  offset-fed,  wrap-rib  antenna  is  pre- 
sented to  illustrate  the  approach.  Although  such  models  are  perhaps  not  as 
precise  as  finite-element  models,  they  can  be  useful  for  initial  physical 
Insight  and  parametric  design. 


INTRODUCTION 

Large  space  structures  are  traditionally  modeled  utilizing  the  finite 
element  method.  This  procedure  results  in  matrix  representations  of  very  high 
dimensions  ranging  from  a few  hundred  to  a .few  tens  of  thousands.  When  used 
for  control  design,  the  high  dimensionality  of  the  model  creates  a gamut  of 
almost  insurmountable  computational  problems  such  as:  the  length  of  computing 

times,  the  high  volume  of  core  memory  required,  the  numerical  truncation  and 
round  off  errors,  and  so  forth.  Furthermore,  onboard  computers  used  in  space 
flights  are  only  capable  of  accommodating  models  with  dimensions  of  the  order 
of  ten  or  less.  To  use  the  finite  element  model,  one  has  therefore  to  resort 
to  order  reduction  and  face  all  the  problems  thus  created. 

Another  undesirable  aspect  of  finite  element  modeling  is  that  the  system 
representations  consist  of  a series  of  numbers  which  provide  no  insight  to  the 
physical  behavior  of  the  system.  Parametric  study  is  thus  made  almost  im- 
possible since,  for  each  new  value  of  each  parameter,  a new  model  of  the  system 
has  to  be  computed. 

A distributed  parameter  modeling,  if  feasible,  would  alleviate  many  of 
these  difficulties.  Indeed,  a partial  differential  equation  representation  has 
a concise  mathematical  formulation,  requires  no  order  reduction,  provides 
physical  understanding  of  the  system,  and  is  very  suitable  for  parametric 
studies.  Moreover,  control  techniques  based  on  partial  differential  equations 
have  been  previously  developed  and  successfully  applied  to  simple  systems. 

In  this  paper,  we  present  a general  approach  for  distributed  parameter 
modeling  of  complex  dynamical  systems.  It  consists  of  first  partitioning  the 
system,  into  a number  of  elements  which  can  each  be  modeled  by  simple  partial 
differential  equations.  Hamilton's  variational  formalism  is  then  applied  to 
the  entire  system.  This  results  in  coupling  the  equations  describing  each 
part  into  a set  of  equations  representing  the  whole  system.  The  following 
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section  of  this  paper  details  the  procedures.  The  application  of  the  method  to 
the  modeling  of  a 55-m,  offset-fed,  wrap-rib  antenna  is  described  in  the  last 
section  to  illustrate  the  approach. 


CONCEPT 

A large  space  structure  can  be  conceptually  partitioned  into  an  assembly 
of  rigid  and  flexible  bodies,  as  schematized  in  Fig.  1.  It  is  indispensible 
that  the  partitioning  be  conceived  as  to  lead  to  flexible  bodies  of  simple 
structure  such  as  strings,  beams,  membranes,  and  so  forth. 


The  dynamical  model  for  the  structure  is  derived  using  the  variational 
approach,  i.e.,  by  writing  that  for  any  virtual  displacement  from  the  system's 
trajectory 

r 2 r 2 


(T-V)dt  + 6 


F.rdt  = 0 


where  T and  V are  the  kinetic  and  potential  energies  respectively  and  F.r 
indicates  the  work  of  the  applied  forces  during  the  displacement. 


The  modeling  task  is  therefore  reduced  to  the  computation  of  the  different 
terms  T,  V,  F.r  and  their  respective  variations. 


Kinetic  Energy 


The  kinetic  energy  is  given  by  the  expression 


T = I ( 


pR.R  dQ) 


where  R is  the  vector  from  the  origin  of  the  inertial  reference  frame  to  an 
arbitrary  point  of  the  system  (cf.  Fig.  1)  and  p the  mass  density  at  this 
point.  indicates  the  volume  of  the  ith  body,  and  (')  the  differentiation  with 
respect  to  time.  The  summation  is  performed  over  all  rigid  and  flexible  bodies. 


Performing  the  indicated  integrations,  one  obtains  an  expression  of 
the  form 


^ x.'^  M.x.  + ^ (y 

FBi 


2 1 11 


FB-i 


x„.  M„.  x„,  dn.) 
2x  2x  2x  X 


The  first  term  in  this  expression  represents  the  sum  of  the  kinetic 
energies  due  to  the  movement  of  the  rigid  bodies  and  to  the  rigid  body  motion 
of  the  flexible  bodies.  The  second  term  characterizes  the  kinetic  energy 
produced  by  the  deformations  of  the  flexible  bodies,  x-j^  is  a vector  of  continuous 
time  functions:  X]^  e CCR’^)  . X2£  are  vectors  of  continuous  functions  of  time 

and  space  on  the  flexible  body  i (FBj^)  : X2j^  e L^(FBj).  and  M2±  are  self- 

adjoint  positive  semidefinite  matrices.  The  physical  properties  of  the 
structure  impose  a set  of  geometric  boundary  conditions  involving  x^  and  X2i • 
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Figure  1.  Conceptual  Representation  of  a Large 
Space  Structure 


Defining  the  state  space 
n 

s = C(r’^)  X n l2(fb^) 

and  the  proper  scalar  product  < ,>  on  this  space  the  kinetic  energy  can  be 
written  as 


• • 


<Mx,  x> 


where 


T T T T. 

X = (x^  x^^  X22  . . . x^^  ) e s 


and  M is  a self-adjoint,  positive  definite,  linear  mapping  from  s to  s . 

Potential  Energy 

The  potential  energy  is  stored  in  the  distortions  of  the  flexible  bodies 
only.  It  is  given  by 


V=  I 


FBi 


•'FBi 


Ki(x2i,  x^.)  dfi. 


where  are  positive  semidefinite  quadratic  forms.  It  is  always  possible 
(Riesz  representation  theorem)  to  reduce  V to  the  form 

V = i <Kx,  x> 
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where  K is  a self-adjoint  positive  semidefinite  linear  mapping  from  s to  s . 

Model 

The  kinetic  and  potential  energies  evaluated,  the  next  step  is  to  apply 
the  variational  principle.  Let  y be  an  arbitrary  admissible  state  of  the 
system  such  that  yCt^)  = y(t2)  = 0,  and  let  Ay,  AeR,  be  a virtual  displacement 
from  the  trajectory  x.  To  the  first  approximation,  for  A small,  the  variation 
of  the  time  integral  of  the  kinetic  energy  is  given  by 


Tdt  = A 


or  integrating  by  part 


Tdt  = - A 


<Mx,  y>  dt 


<Mx,  y>  dt 


The  kinetic  energy  leads  to 


ft. 


Kdt 


<Kx,  y>  dt 


and  the  virtual  work  to 


F.r  dt  = A 


<F,  y>  dt 


Thus,  the  equations  of  motion  are  given  by 


(<Mx,  y>  + <Kx,  y>  - <F,  y>)  dt  = 0 
L 

or,  since  this  equation  must  hold  for  all  arbitrary  times  t^  and  t2  and 
arbitrary  vectors  yes. 


<Mx,  y>  + <Kx,  y>  = <F,  y> 


which  can  be  written  symbolically 
Mx  + Kx  = F 

Note  that  M and  K are  matrices  whose  elements  are  scalars  and  functions 
of  spatial  variables,  as  well  as  operators  with  domain  in  FBj^s , the  flexible 
bodies . 


MODELING  OF  A 55-m  WRAP-RIB  ANTENNA 

The  modeling  of  a 55-m  wrap-rib  offset  antenna  will  be  presented  in  this 
section  to  illustrate  the  approach  described  in  the  previous  section.  The 
antenna,  designed  for  the  Land  Mobile  Satellite  Service  (LMSS)  Project 
[Ref.  1]  is  schematized  in  Fig.  2.  It  consists  of  a massive  spacecraft-feed 
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assembly  and  a 55-m,  deployable,  wrap-rib,  ref lector  dish  connected  by  an  L- 
shaped  frame.  The  longer  arm  of  the  frame,  which  supports  the  spacecraft-feed 
assembly  at  its  free  end,  has  a length  of  80  m and  the  shorter  arm,  which 
supports  the  dish,  a length  of  33.8  m.  In  the  sequel,  the  short  arm  will  be 
referred  to  as  the  upper  boom  and  the  long  arm  as  the  lower  boom. 


The  following  approximations  and  assumptions  are  made  for  the  modeling: 


1)  The  longitudinal  distortions  of  the  booms  are  neglected:  only  lateral  flexures 
and  torsion  about  the  principal  axis  of  the  boom  was  considered. 

2)  The  boom's  masses  are  also  neglected.  Indeed,  they  constitute  less  than  3% 
of  the  total  mass. 

3)  The  dish  is  assumed  planar  and  attached  to  the  upper  boom  through  a rigid 
and  fixed  hub. 

4)  The  spacecraft-feed  assembly  is  approximated  by  a rigid  body  whose  center  of 
gravity  is  placed  at  the  extremity  of  the  lower  boom. 

Kinetic  Energy 

Let  mQ  and  Iq  be  the  mass  and  Inertia  matrix  of  the  spacecraft-feed 
assembly  and  the  reflector-dish  mass.  It  is  easily  established  that, 
neglecting  the  booms’  masses,  the  kinetic  energy  of  the  system  is  given  by 


1 • • 1 • T ,•  1 

i-2»R.e+2*o  Vo + 2 


pp.p  d 0 


where 


"*0  “l 

m = ^ is  the  reduced  mass  of  the  system, 

mo  + mi 


R = the  vector  joining  the  spacecraft  and  dish  mass  centers  (cf . Fig.  3), 


Aq  = the  rotation  vector  of  the  spacecraft, 
= the  domain  of  the  dish. 


p = the  dish  mass  density,  and 


P = the  vector  from  the  dish  mass  center  to  an  arbitrary  element 
of  the  dish  surface. 


The  value  of  the  second  term,  in  the  expression  for  T,  is  directly  given, 
but  the  other  two  terms  have  to  be  evaluated. 

For  the  first  term  we  need  the  vector  R.  Let  us  define  two  orthogonal 
coordinate  systems  as  follows  (cf . Fig.  3): 

(gl,  g2 , g3>  with  the  origin  at  the  elbow  of  the  booms  at  rest;  g3  has 
the  same  direction  and  orientation  as  the  vector  rQ  joining  the  space- 
craft's mass  center  to  the  elbow  at  rest,  g^  orthogonal  to  g3  in  the 
plane  of  the  booms  at  rest  and  oriented  such  as  to  make  the  smallest 
angle  with  rj^ , vector  coinciding  with  the  upper  boom  at  rest  and 
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oriented  towards  the  reflector's  hub;  g2  is  such  as  to  form  a direct 
system  with  and  g3  . 

(S4>  85.  §6^  with  the  origin  at  the  reflector's  center  at  rest;  g^  is 
the  outward  normal  at  the  center  of  the  dish,  85  is  along  and  in  the 
same,  orientation  as  r^ , and  g4  forms  a direct  system  with  g5  and  gg. 
Let  u be  the  displacement  of  the  lower  boom's  tip  expressed  in  the 
(gl , g2  , 83)  coordinate  frame. 

s is  the  displacement  of  the  upper  boom's  tip  expressed  in  the  (g4, 
g5 , gg)  coordinate  frame. 

hg  is  the  vector  from  the  dish  point  of  attachment  to  the  dish  mass 
center  in  the  undistorted  configuration. 

Pq  is  the  translation  of  the  mass  center  of  the  dish  due  to  the  dish 
distortion  only. 

Given  these  definitions  and  the  assumption  that  the  booms  have  no  longi- 
tudinal distortion,  the  vector  R is  given  by  (cf  . Fig.  3) 

R = r^  + r^  + hg  + Pq  + u^g^  + U2g2  + s^g^  + S285 


A simple  differentiation  leads  to 

R = Aq  X R + u^g^  + + s^g4  + >^0  ^^1^3 

+ ;2gg  ^ ^0 


where  is  the  rotation  of  the  tip  of  the  lower  boom  about  g3  and  Y2  the 
rotation  of  the  tip  of  the  upper  boom  about  gg . 

r • • 

It  remains  to  evaluate  the  last  term,  pP.PdO.  Let  Pq  be  the  vector 
from  the  mass  center  of  the  dish  to  some  arbitrary  point  in  the  undistorted 
configuration  which  corresponds  to  P after  deformation,  and  let  y be  the 
vector  joining  this  point  before  deformation  to  its  position  after  distortion 
(cf.  Fig.  4).  To  the  first  approximation. 


Pq  = P 


which  yields 


P - (A„  + Y^g3  + Po  + " - “0 


With  these  values  of  R and  P,  some  simple  but  tedious  algebraic  manipula- 
tions lead  to  the  following  expression: 


1 *T 

T = 2 X Ml 


X -I 


PP.P  dS^ 
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REFLECTOR  DISH  (rHj,  1^) 


Figure  3.  Diagram  of  Antenna  after  Deformation 


Figure  4,  Schematic  Representation  of 
Dish  Distortion 
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The  state  space  vector  x is  given  by 


X = (XqT  S2  Y2  1^/)^ 


With  ’I^Q  defined  as 

ijj  = X 
^0  1 


PP  x(p  - P ) d^^, 


where 


is  the  inertia  matrix  of  the  dish  in  its  undistorted  state,  and  E the  identity 
matrix . 

The  matrix  is  given  by 


'o^'i^'oo 


m R g^ 

m R 

"4 '10’%  94 

mg^r^g,  |mg|g^ 

1 

mRgj 

m 

0 

mgjgs 

0 

m 

T~T  ' T 

™ 93  ^ 93  1 Sg  ^4 

mgjgj 

mgjfjgi 

^ % ^2  ^ 

4‘'4'u»%!"'«3^294 

rag^rjg. 

~ J:T~  “ 


4 


m R 


m gj 
m gj  R^ 
J 


93(4  Hqj) 


4 


mg, 


mgj 


'J  I 1:1  '0  I 

|mgj 

m i;T  „ I J 

mgjhogj  | m g^ 


”4 


mgjR^ 

T - T 
mg*  r' 


mgjgj  mgjg^  mgjrjgj 

IT'  9;  9l  m 9J  9^  m gj  g^ 


^*'4  'o2* 

1 

4 

mgjhggj 

9j  tig  93 

gj(l,.l 

12’  ^6  ^0  ^4 

mgghgg; 

9glll  + I33I 

mR^ 

1 

1 

mgj 

mgj 

m ?2  9^ 

1 ">94 

mgj 

mhjg^ 

4 

1 

1 

0 

0 
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i 

1" 

0 

'1^6 

4'l 


(m  - mj)E 


The  dish  is  formed  by  N (forty-eight  in  our  case)  lenticular  cross- 
sectional  ribs  attached  to  a central,  circular,  rigid,  hub  with  equal  angular 
spacing  0q  (cf . Fig.  5).  A pretensioned  mass  is  stretched  between  the  ribs. 
The  dish  can  thus  be  divided  in  N identical  sectors  numbered  from  zero  to  N-1. 
Due  to  the  na.ture  of  the  material  forming  the  mesh, the  distortions  of  the 
dish  surface  are  due  mainly  to  the  deformations  of  the  ribs  . The  ribs  are 
considered  to  be  cantilevered  at  the  hub.  To  compute  the  kinetic  energy,  the 
mass  of  the  mesh  is  lumped  on  the  ribs.  For  each  sector,  the  kinetic  energy 
is  given  by 


T 

n 


p V .V  dfi  + 


n n 


w .w 
n n 


df^) 


where  v is  the  ln--plane  distortion,  i.e. , the  projection  of  p on  the  plane  of 
the  dish;  w is  the  out-of -plane  distortion , 1. e. , the  projection  of  p on  the 
normal  to  the  dish  plane;  and  L is  the  length  of  the  rib. 
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1 

2 


1 

2 


[Ell,  (- 


^ (r  ,t)  -,(r,t)  ar  (r,t) 

^ ^ ■ ) + GiJ,  ( 


3r 


3r 


1 Sr 


-)  ] dr  + 


S^S  (r ,t)  S^S  (r,t) 

‘hh  <— T + — T— ) + <“ 


sr  (r,t) 

-)^]dr 


Sr 


Sr 


For  each  sector  of  the  dish,  neglecting  the  torsion  of  the  ribs,  the  poten- 
tial energy  is  the  sum  of  the  following  components: 

Bending  energy  of  the  rib; 

In -Plane: 


1 

2 


EI,  ( 


* 2 , 

^ ) dr 

Sr 


Out-of -Plane: 


1 

2 


El  (■ 
o 


S w (r,t)  2 
— n ) dr 


Sr 


Mesh  shear  energy : 
[-L 


1 

2 


^ o ■' 


®0  „ ''^''^>'=>,2 

^ ) rdedr 

o Sr 


Out-of-plane  stiffness  energy  due  to  circumferential  pretension: 
rL  T 


1 

2 


^ w ^ (r , t)  dr 


r0  n 
0 o 


In-plane  stiffness  energy  due  to  mesh  stretching; 
rL  E 


1 

2 


^ V ^(r,t)  dr 


^ r6  n 
0 o 


Equations  of  Motion 


The  expi’essions  obtained  for  the  kinetic  and  potential  energies  lead  to 
the  following  equations  for  the  antenna: 


3 

9 U 

M X = (0 

Sr 


s% 

sr. 

3 

S S, 

9^S, 

2 

1 

1 

2 

2 

3 

^ 9t 

9r 

Sr^ 

Sr^ 

dr 

r=0 

3 

II 

r=0 

r=0 

li 

o 

0 0) 


r=0 


4 4 2 

s u.  s s.  s r. 

1 _ 1 _ 1 

4 4 4 

Sr  Sr  Sr 


i = 1,2 
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Remark : 


The  equations  of  the  in-  and  out-of-plane  motion  of  the  ribs  are  given  in 
a matrix  form.  They  can  be  decoupled  in  a set  of  scalar  equations  by  noticing 
that  both  matrices  and  have  eigenvectors  given  by 


cos 


0 (n-1) 


2tr 

N 


+ sin  0 (n-1)  ^ 

N 


cos  1 (n-1) 


2tt 

N 


+ sin  1 (n-1) 


2ti 


cos  (N-1) (n-1) 


2tt 

N 


sin  (N-1) (n-1) 


27T 


corresponding  to  the  eigenvalues 


and 


In 


2n 


- 4sin^(n-l)  ^ 
4 cos^(n-l)  “ 


n = 1 , . . . ,N 


for  A.|^  and  k^  respectively, 


The  decoupled  equations  can  be  written  as 

„2 


3 P, 


PP  + 
n 


Sx 

2 


7T  El 
2 o 


9x 


„2 

..  g-  9 q 

pq  + ^ El.  ^ 

n .2  1 -s  2 

9x  9x 


T X., 
c In 

“x0 


E 0 . 3q 

c In  , o T,  n 

x0  n 2 rc  2n  9x  9x 


Computation  Results 

In  order  to  establish  the  accuracy  of  the  model  and  its  adequacy  for 
simple,  and  fast  computations,  the  modal  frequencies  of  the  reflector  dish  were 
evaluated  using  a Ritz  approximation  scheme  with  assumed  mode  shapes  of  the 
form 


V /I  r. 

2,  a^  (1  - cos  — ~)  . 

The  following  table  shows  a comparison  of  the  results  to  those  obtained 
through  finite  element  modeling  [Ref.  2]  (using  6624  modes). 
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Table  1.  Comparison  Between  Continuum  and  Finite  Element  Models 


Circumferential  Wave  Number 

PDE 

FE 

0 

1 

0 

1 

1st  out- of -plane 

0.77276 

0.772763 

0.78971 

0.78988 

2nd  out-of-plane 

3.33615 

3.33615 

- 

- 

1st  in -plane 

0.175461 

0.175877 

0.18134 

0.19082 

2nd  in- plane 

0.726048 

0.725721 

0.70185 

0.70423 

The  accuracy  is  quite  satisfactory  as  it  can  be  observed.  As  to  the 
required  computational  effort,  the  difference  with  the  finite  element  method 
is  striking;  the  whole  algorithm  could  be  programmed  on  an  intelligent 
terminal,  and  the  computational  cost  was  about  30  times  less. 

CONCLUSION 

A general  approach  to  partial  differential  equation  modeling  of  large 
space  structures  was  presented  and  its  application  to  the  modeling  of  a large, 
wrap-rib,  offset-fed  antenna  was  described. 

The  partial  differential  model  thus  obtained  has  all  the  expected  traits 
of  simplicity,  conciseness,  and  suitability  for  parametric  studies. 

The  model  was  used  to  evaluate  the  modal  frequencies  of  the  dish.  For  low 
frequencies  the  results  are  very  close  to  those  obtained  from  a finite-element 
model  of  very  high  dimension.  The  striking  feature  is  that  the  required 
computational  effort  was  more  than  an  order  of  magnitude  smaller. 
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MODELING  OF  FLEXIBLE  STRUCTURES 
FOR  ACTIVE  CONTROL* 

Arthur  E.  Bryson,  Jr. 

Stanford  University 
Stanford,  CA  94305 


ABSTRACT 


If  a flexible  structure  has  a plane  of  symmetry,  the  equations  of  motion  can 
be  split  into  two  uncoupled  sets,  one  for  symmetrical  motions  and  one  for  anti- 
symmetric motions. 

If  there  are  m controls,  it  is  often  convenient  to  assign  the  linear  com- 
binations of  controls  that  enter  into  the  m lowest  frequency  modes  and  "new" 
controls . 

As  an  example  the  feed-support  structure  of  a spacecraft  antenna  is  consid- 
ered. It  is  modeled  as  a tetrahedron  made  up  of  flexible  bars  and  connected  to 
the  spacecraft  by  six  short  flexible  legs  containing  force  actuators  and  displace- 
ment seiasors.  Due  to  the  three-sided  symmetry  of  this  structure,  both  the 
symmetric  and  the  anti-symmetric  equations  of  motion  can  be  decoupled  into  two 
subsystems.  The  resulting  four  subsystems  are: 

(1)  Pitch/fore-af t motions  with  four  degrees  of  freedom  (DOF),  two 
controls,  and  one  output  (the  fore-aft  motion  of  the  feed). 

(2)  Vertical  motions  with  three  DOF,  one  control,  and  one  output 
(the  vertical  motion  of  the  feed). 

(3)  Roll/lateral  motions  with  four  DOF,  two  controls,  and  one 
output  (the  lateral  motions  of  the  feed). 

(4)  Yaw  motion  with  one  DOF,  one  control,  and  no  output  (the  feed 
does  not  move  during  yaw  motion). 

Such  a decomposition  obviously  simplifies  the  task  of  synthesizing  active 
control  logic  for  the  structure. 


INTRODUCTION 


The  example  was  suggested  by  the  ACOSS  (Active  Stabilization  of  Space 
Structures)  problem  posed  by  the  Draper  Laboratory  a fev7  years  ago  (see  Ref.  1 
and  Figure  1).  The  only  difference  in  the  data  from  that  problem  is  that  we  have 

* 

This  research  was  supported  under  NASA  Grant  NAG-1-236  from  the  Langley  Research 
Center,  monitored  by  Ray  Montgomery. 
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taken  the  three  vertical  bars  of  the  tetrahedron  to  be  identical.  This  results 
in  four  lateral  modes  that  have  identical  frequencies  to  four  longitudinal  modes 

The  ACOSS  problem  may  be  interpreted  as  the  problem  stated  in  the  abstract 
if  one  assumes  that  the  spacecraft  mass  is  large  compared  to  the  feed-tower  mass 
’ so  that  the  spacecraft  motions  are  negligible. 


EQUATIONS  OF  MOTION 


The  structural  model  is  oversimplified  and  was  apparently  only  intended  to 
be  an  example  of  a multi-input,  multi-output  system.  The  structure  consists  of 
twelve  bars,  six  of  which  (10  meters  long)  form  a regular  tetrahedron  with  the 
feed  at  its  apex  (point  1).  Two  bars  (each  2.828  meters  long)  connect  each  of 
the  three  joints  at  the  base  of  the  tetrahedron  (points  2,3,4)  to  the  antenna 
dish  on  the  spacecraft.  The  cross-section  area  of  the  three  base  members  is  ten 
times  the  cross-section  area  of  the  vertical  members  and  the  legs.  The  mass  of 
the  structure  is  "lumped"  at  the  four  joints  (equal  mass  at  each  joint) . 

Since  there  are  two  actuators  applying  forces  at  each  of  the  three  base 
joints,  it  is  possible  to  create  independent  horizontal  and  vertical  forces  at 
each  of  these  joints.  Let  (V£,IIj^)  be  the  (vertical,  horizontal)  forces  at  joint 
i (i = 2 to  4)  as  shown  in  Figure  2. 

» Let  k(i)  be  the  values  of  EA(i) /mL (i) (stiffness/mass)  for  the  base 

members  (i=l),  the  vertical  members  (i=2),  and  the  legs  (i  = 3),  where 
E=  Young's  modulus,  A=  cross-section  area  of  the  member,  L = length  of  member, 
and  m = lumped  mass  at  each  joint.  We  have  taken  m = 2,  and 


k(l)  = 50 

1 / 2 
1/sec  , 

k(2)  = 5 

1 / 2 
1/sec  , 

k(3)  = 17.6775 

1 / 2 
1/sec  . 

Using 

the  vectors  x = (x  ,y  , z ; x ,y 

'’^2’ 

f = (V„,H  ; 

V„,H  ; V, ,R,)',  the  equations  oi 

■ motion  may  then  be  written  as 

X = -K*x  + G‘f, 

where 

K = 12  by  12  matrix, 

G = 12  by  6 matrix. 

’ The  matrices  K and  G were  calculated  and  are  given  in  Appendix  A. 

DECOMPOSITION  INTO  SYMMETRIC  AND  ANTI-SYMMETRIC  MOTIONS 

Motions  of  the  feed-tower  can  be  decomposed  into  motions  that  are  either 
symmetric  or  anti-symmetric  with  respect  to  the  y-z  plane  of  S5mimetry.  The 
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decomposition  can  be  performed  by  the  following  change  of  variables: 


s 


- x^)/2. 


(Y2  + Z3)/2,y^,z^) , 


a = (x^,  (X2  + x^)/2,  (y^  - - Z2)/2,x^), 

S = ((H^  - ^\)n,  (V^  + V3>/2,V^j 
A = + K3)/2,  (V2  - V^)/2,II^j 


where  (s,a)  are  vectors  of  (symmetric,  anti-symmetric)  joint  displacements,  and 
(S,A)  are  vectors  of  (symmetric,  anti-S3anmetric)  actuator  forces.  The  equations 
of  motion  then  split  into  two  uncoupled  sets; 


and 


s = -KS*s  + GS*S, 
a = -KA-a  + GA-A. 

The  KS,  KA,  GS,  GA  matrices  were  calculated  and  are  given  in  Appendix  A. 


DECOUPLING  OF  TFiE  SYMMETPvIC  MOTIONS  INTO  PITCH/FORE-AFT  AND  VERTICAL  MOTIONS 


Using  an  eigenvalue/eigenvector  computer  code,  the  symmetric  equations  of 
motion  were  put  into  modal  form  (see  Appendix  B) . The  three  modes  of  this  system 
with  the  lowest  frequencies  correspond  to  quasi-rigid  pitching,  vertical  transla- 
tion, and  fore-aft  translation.  Linearly-independent  combinations  of  the 
symmetric  actuator  forces  control  each  of  these  quasi-rigid  modes,  so  we  chose  to 
regard  these  combinations  as  "new"  symmetric  controls.  When  this  was  done,  a 
further  decoupling  appeared,  in  that  two  of  the  four  higher-frequency  (deformation) 
modes  were  controllable  only  by  the  new  pitch  and  fore-aft  controls  (f  ,f^),  while 
the  other  two  modes  were  controllable  only  by  the  new  vertical  control'^(f  ). 
Furthermore,  fp  and  f^  control  only  the  fore-aft  motions  of  the  feed  (y.j^)  , 
while  f^  controls  only  the  vertical  motion  of  the  feed  (z^) . 

The  modes  that  involve  only  fore-aft  motions  of  the  feed  are  shown  in  Figure 
3.  The  pitch/fore-aft  equations  of  motion  in  modal  form  are  given  in  Figure  4. 
Figure  5 shows  the  combinations  of  the  primary  controls  that  form  f^  and  f^. 

The  modes  that  involve  only  vertical  motion  of  the  feed  are  shown  in  Figure 
6.  The  vertical  equations  of  motion  in  modal  form  are  given  in  Figure  7.  Also 
shown  in  Figure  7 is  the  combination  of  primary  controls  that  forms  f (equal 
values  of  ^ 

DECOUPLING  OF  THE  ANTI-SYMMETRIC  MOTIONS  INTO  ROLL/LATERAL  AND  YAW  MOTIONS 


A similar  procedure  was  follow^ed  for  the  anti-symmetric  motions  (see  Appendix 
B) . The  three  lowest  frequency  modes  vrere  quasi-rigid  roll,  lateral  translation. 
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and  yaw.  Here  the  two  higher  frequency  (deformation)  modes  were  controllable 

only  with  the  new  roll/lateral  actuator  forces  (f  .f-,). 

r 1 

The  modes  that  involve  only  roll/lateral  motions  of  the  feed  are  shown  in 
Figure  8.  The  roll/lateral  equations  of  motion  in  modal  form  are  given  in  Figure 
9.  Figure  10  shows  the  combinations  of  the  primary  controls  that  form  f^  and 


The  modes  that  involve  only  yaw  motion  of  the  feed  are  shown  in  Figure  11. 
The  yaw  equation  of  motion  in  modal  form  are  given  in  Figure  12.  Also  shown  in 
Figure  12  is  the  combination  of  primary  controls  that  forms  f (equal  values  of 

The  natural  frequencies  of  the  roll/lateral  subsystem  are  identical  to  those 
of  the  pltch/fore-af t subsystem,  which  is  not  surprising  in  view  of  the  symmetry 
of  the  structure. 


SUMMARY 


Figure  13  is  a summary  of  the  decoupling  of  the  system  into  four  subsystems. 
The  24th  order  system  with  six  controls  and  three  outputs  has  been  decoupled  into 
two  8th  order  subsystems  with  two  controls  and  one  output,  a 6th  order  system  with 
one  control  and  one  output,  and  a 2nd  order  system  with  one  control  and  no  output. 
This  greatly  simplifies  the  problem  of  synthesizing  active  control  logic  for  the 
system. 
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Figure  1.  Antenna-Feed  Tower 


Figure  2.  Control  Forces 


Figure  3.  Modes  that  Involve  Only  Y Motions  of  Apex 
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Figure  4.  Pitch/Fore-Aft  Subsystem 


MODE  m,  (PITCH)  CONTROLS  (fp) 


MODE  ni4(FWD.  TRANSL.)  CONTROLS  ( f p ) 


Figure  5.  Pitch/Fore-Aft  Controls 
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VJliere  is  Qua  si -Rig  id , (111^,111^2)  Deformation  modes, 
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Figure  8.  Modes  that  Involve  Only 
X Motions  of  Apex 
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Figure  9.  Roll/Lat'eral  Subsystem 
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MODE  mj  (ROLL)  CONTROLS  (fp) 


MOPE  itij  (LATTRANSL.)  CONTROLS  (fL) 


Figure  10.  Roll/Lateral  Controls 


Figure  11,  Antl-Symmetrlc  Mode  Involving 
no  Motion  of  Apex 
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l>There 


And 


mg  = -(4.204)^  mg  + .5773 


Figure  12.  Yaw  Subsystem 


Figure  13.  Summary  of  Uncoupled  Subsystems 
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APPENDIX  A.  EOUATIONS  OF  MOTION 


Let: 


x(l,k) 
u(i, j,k) 

k(i) 

where  i = 1 for 
legs,  and 


= kth  component  of  the  displacement  vector  of  the  ith  joint, 

= kth  component  of  nominal  unit  vector  from  ith  to  j th  joint, 
= kth  component  of  control  force  on  ith  joint 
= E"A(i)/ni*L(i) , 

= stiffness/mass, 

base  members,  i = 2 for  vertical  members,  and  i=3  for  the 


E = Young's  modulus, 

A = cross-section  area  of  bar, 
m = lumped  mass  at  each  joint, 
L = length  of  bar. 


Then  the  equations  of  motion  are: 


x(l,k)  = - k(2)-u(l,2,k)-u(l,2,j)*[x(l,J)  - x(2,j)] 

- k(2)-u(l,3,k)-u(l,3,j)-[x(l,j)  - x(3,j)] 

- k(2)-u(l,4,k)-u(l,^i,j)*[x(l,j)  - x(4,j)], 

x(2,k)  = - k(2)-u(2,l,k)-u(2,l,j)-[x(2,j)  - x(l,j)] 

- k(l)'u(2,3,k)*u(2,3,j)‘[x(2,j)  - x(3,j)] 

- k(l)*u(2,4,k)'u(2,4,j)*[x(2,j)  - x(4,j)] 

- k(3)-u(2,5,k)-u(2,5,j)*x(2,j) 

- k(3)*u(2,6,k)*u(2,6,j)*x(2,j) 

+ f(2,k). 


x(3,k)  = - k(2)‘u(3,l,k)'u(3,l,j)-(x(3,j)  - x(l,j)] 

- k(l)Mi(3,2,k)'u(3,2,j)'[x(3,j)  - x(2,j)] 

- k(l)-u(3,4,k)-u(3,4,j)-[x(3,j)  - x(4,j)] 

- k(3)*u(3,7,k)*u(3,7,j)*x(3,j) 

- k(3)'u(3,8,k)*u(3,8,j)-x(3,j) 

+ f(3,k). 


x(4,k)  = - k(2.)-u(4,l,k)*u(4,l,j)'[x(4,j)  - x<l,j)] 

- k(l)  •u(4 ,2  ,k)  •u(4 ,2,  j)  • [x(4  , j ) - x(2,j)] 

- k(l)-u(4,3,k)'u(4,3,j)'[x(4,j)  - x(3,j)] 

- k(3)-u(4,9,k)-u(4,9,j)'x(4, j) 

- k(3)-u(4,10,k)-u(4.10,j)-x(4,j) 

+ f(4,k), 

where 


1(2,1) 

= -s*ll2/ni. 

f(2,2)  = c'H^/m, 

f(2,3) 

= V^/m, 

f(3,l) 

= -s'll^/m. 

f(3,2)  = -c'U^/m, 

f(3,3) 

= V^/m, 

f(4,l) 

= U^/m, 

f(4,2)  = 0 

f(4,3) 

= V^/m, 

and 


We  have  taken  m==2  and: 


k(l) 

= 50 

1/sec 

k(2) 

= 5 

l/sec^ 

k(3) 

= 17.6775 

1 / 2 
1/scc 

A BASIC  computer  code,  TETRA,  was  developed  to  calculate  the  terms  In  the 
equations  above,  and  to  find  the  terms  in  the  symmetric  and  anti-symmetric  equa- 
tions of  motion.  It  is  listed  in  the  following  pages,  followed  by  the  output. 
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100  REM  TETRfl  4/26/82 

110  REM  FINDS  STIFFNESS  MATRIX  K,.  A 
^26  REM  CONTROL  HISTRIEUTION  MATRIX 
S0  REM  *■  G,.  FOR  TETRAHEDRON  ON  LEGS, 

140  REM  # WHERE 

150  REM  # XDOTnOT=-K>NX+G:i;IJ. 

160  REM 

170  REM  THEN  FINDS  K FIND  G FOR 
180  REM  # SVMMETRIC  4l  ANTI-SVMMETRIC 
190  REM  # MOTIONS. 

200  REM 
210  REM 

220  REM  * READS  DATA  ^ 

230  REM 

240  D I MX  C 1 0 , 3 ) , M ( 4 , 1 0 , 3 > , K C 12,1 2>  ••  FOR  I 1 TO  1 0 FOR  J=  1 T03  READX  < I , J ) ••  NEXT  J , I 
250  REHDKl,K2,KL:LEFFNR<S>~INTa0000*S+.5>/ 10000 
260  REM 

270  REM  $ FINDS  NOMINAL  DISPLACEMENT  VECTORS  FROM  FOUR  TETRAHEDRON  JOINTS 
280  REM  TO  EACH  OTHER  & ALONG  LEGS 
290  REM 

300  FOR  I = 1 T04 : FOR  J= 1 TO  1 0 ••  FORK= 1 T03 : M -1 1 , J , K ) =X  < I , K > -X  < J , K > : NEXTK , J , I 
310  REM 

320  REM  iK  FINDS  ELEMENTS  OF  STIFFNESS  MATRIX  * 

330  REM 

340  FORI=^l T03 : F0RJ=4T06 : K(I , J>=M(1 , 2, I >#M< 1 , 2, J-3)#K2 : NEXTJ, I 
350  FOR  I ==  1 T03 : FOR  J=7T09 : K < I , J ) =M  < 1 , 3 , 1 > ^iM  < 1 , 3 , J-6  > #K2 : NEXTJ , I 
360  FOR  I = 1 T03 : FOR  J=  1 0TO 1 2 : K C I , J > =M  <1,4,1 ) ^:M  <1,4 , J-9 ) :F:K2 : NEXTJ , I 
370  F0RI=4T06:F0RJ=7T09M«I,  j;:'=M<2,3, 1-3>»1<2,3,  J-6):^:K1  MIEXTJ,  I 
■••90  FORI=4TO6:FORJ=10TO12:K<I,  J>=M<2,4,I-3)*:M<2,4,  J-9>i+:Kl  MIEXTJ,  I 
^9d  F0RI=7T09  ^ FORJ=10TO12 : K< I , J)==M<3, 4, 1-6)$M<3, 4,  J-9>#K1  NEXTJ,  I 
400  FORI=l T03 : F0RJ=4T012 : K< J, I >=K< I , J) : NEXTJ, I 
410  F0RI=4T06:F0RJ=7T012:K<J,  I>=K<I,  J)  M-IEXTJ,  I 
420  FORI=--7TO9.'FORJ=10TO12M«J,I)=K<I,J>:NEXTJ,  I 

430  F0RI=lT03'F0RJ=lT03:K<I,J)=-K<I,J+3>-K<I,J+6)-K<I, J+9) :NEXTJ, I 
440  F0RI=4T0S  F0RJ=4T06  • K<  I , J)=-K<I , J-3;j--K<  I , J+3)-K<  I , J+6.^  NEXTJ,  I 
450  F0RI=4T06 : F0RJ=4T06 : K<  I , J)=K<  I , J>-M<2, 5, 1-3)^iM<2, 5,  J-3)#:L  : NEXTJ,  I 
460  F0RI=4T06N"0RJ=4T06:K<I,  J>=K<I,  J)-M<2,6, 1-3)«1<2, 6,  J-3:>*KL  : NEXTJ,  I 
470  F0RI=7T09 : F0RJ=7T09 : K< I , J>=-K< I , J-6>-K< I , J-3>-!<< I , J+3> : NEXTJ, I 
480  F0RI=7T09 : F0RJ=7T09 : K< I , J>=K< I,  J>-M<3, 7, 1-6>:FM(3, 7,  J-G).+iKL : NEXTJ,  I 
490  F0RI=7T09 : F0RJ=7T09 M« I , J)=K< I , J>-M<3, 8, 1-6)»K3, 8, J-6>*KL : NEXTJ, I 
500  FORI  = 10TO12'FORJ=10TO12:K<I, J>=-K<I, J-9>-K<I, J-6)~K<I, J-3)  HIEXTJ, I 
510  FORI  = 10TO12:FORJ=10TO12-K<I, J>=K<I, J>-M<4,9, 1-9)*M<4, 9,  J-9)*KL : NEXTJ, I 
520  FORI  = 10TO12:FORJ=10TO12;K<I,  J>=K<I,  J>-M<4,  IG,  I-9)$M<4, 10,  J-9;'#t(L  : NEXTJ,  I 
530  PRINTTAEK 10./:  "STIFFNESS/MASS  MATRIX,  UPPER  LEFT  QUADRANT" 

540  F0RI  = 1 T06 : PRINTTAEK  1 0) ; FORJ=l  T06 : PRI  NTFNR<K<  I , J > > : NEXTJ  ^ PRINT : NEXT  I 
550  PRINTTflE<10/:  "UPPER  RIGHT  QUADRANT"  : F0RI  = 1T06  : PRINTTRB< lO.'/:  ^F0RJ=7T012 
560  PRINTFNR<K<I, J>>; : NEXTJ : PRINT : NEXT I 
570  PRINTTRE<  10).;  "LOWER  RIGHT  QUADRANT" 

580  FOR  I =7T0 1 2 PR  I NTTAE  < 1 0> ; : FOR  J=7T01 2 ; PR  I NTFNR  <K  < I , J .■> : NEXTJ  : PR  I NT  : NEXT  I 
590  REM 

600  REM  CALCULATES  AHTI-SVMMETRIC,  SYMMETRIC  STIFFNESS  MATRICES  * 

610  REM 

620  DIMT<12, 12>,TK12, 12> , L<  12, 12> , LI < 12, 12)  : C=. 5 : T(  1 , 1 ) = 1 : T<2, 6>  = 1 
630  T<3,7)  = 1 :T<4,2)=1 : T<4, 8)  = 1 ; T<5, 3)=1 : T<5, 9>=1  : T<6,  4)^1  : T<6,  10)  = 1 ^ T<7..'2)  = 1 
10  T<7,8)=-1  •T<8,9.)  = 1 ••T<8,3)=-1  -TO,  10)==1  ■T<9,4.>=-1  ■■T(10,5)=-l  :T<11,  11>=-'1 
650  T<12, 12)  = 1 :TK1,  D = 1 : TI  <2, 4)=C:  TI  <2, 7)=C : TI  <3, 5)=C  ■ TI  <3, 3>-=-C  : TI  <4,  6)=C 
660  TK4,9)=--C:TI<5,10>  = 1 : TI <6, 2>  = 1 : TI  <7, 3>  = 1 ■ TI <8, 4)=C  : TI  (8, 7)^-C : TI  <9,  5)=C 
670  TI <9, S)=C : TI <10, 6)=C : TI < 10, 9)=C : T1 < 1 1 , 1 1 )=1 : TI <12, 12)=1 

680  f-ORI==lT012:FORJ=lTO12:FORK=lTO12:Ll<I,J>==LKI,  J>+K<I,K)#T<K,  J)  HIEXTK,  J,  I 
690  FOR  I = 1 TO  1 2 : FOR. r- 1 TO  1 2 : FORK=  1 TO  1 2 : L < I , J ) =L  < I , J > +T I < I , K ) $L  1 < K , J ) ^ NEXT’K , J , I 
700  PR  I NT  TAE  < 1 0 ) " ANT  I -SVMMETR I C ST  I PFNESS/MASS  MATR I X : " 

7 1 0 FOR  I = 1 T05  : PR  I NTTFlB  < 1 0 ) ; : FOR  J=1  T05  : PR  I NTFNR  < L < I , J ) ) : NEXTJ  : PR  I NT  : NEXT  I 
^20  GOTO750 

30  PRINTTAE<10).:  "CROSS-COUPLING  MATRIX:  " 

740  F0RI=1T05 : PRI NTTAE < 1 8) ; : F0RJ=6T012 : PRINTFHR<L< I , J) > ; : NEXTJ : PRINT : NEXT I 
750  PRINTTAE<10).:  "SVMMETRIC  STIFFNESS/MASS  MATRIX:  " 
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760  FOR  I ^=6T0 1 2 : PR  I NTTflB  <1 0 > J : F0RJ-6T0 1 2 ••  PR  I HTFHR  < L (I ..  J > ) ; : NEXT  J ; PR  I HT  NEXT  I 
I’’  I*"  0 GOT  0 6 C'  0 

780  PR  I NTTRB  < 1 0 > J " CROSS-COUPL I NG  MfiTR  I X " 

790  F0RI=^6T012 : PRINTTNIK 10) : FORJ^ITOS^ PRINTPHRCLCI.  J) > ; : NEXTJ : PRINT : NEXTI 
800  REM 

810  REM  « COMPUTES  GR  & OS  ♦ 

820  REM 

830  ri  I MG  ( 1 2 . 6 ) ..  G 1 ( 1 2 . 6 ) 

840  G<4.,  1)=^-.  25:  G(5..  1)  = . 43301  :G<6.2)=.5:GC7,3)=-.25:G';:8.3)=-.43301  :G';9.4>  = .5 
850  G'^  10.'  5)  = . 5 ■■  G';'  12..  6')~.  5 

860  F0RI==lT012:F0RJ=lT06:F0RK=lT012:Gia.  J>=61<I.J)+TKI.K)*G(K.J)  :NEXTK.  J.  I 
870  PRINTTflEae) : PRINT "fiNTI-SVMMETRI'C  CONTROL  DISTRIBUTION  MATRIX  GRa.-J),.  I 
880  FORI  = l T05 : PRINTTfllK  1 0) : FOR.J=l  T06 : PRINTFNRCGl  C I . J)  > ; : NEXTJ  : PRINT  : NEXTI 
890  FRINTTAB<10)  :PRINT"3VNMETRIC  CONTROL  lUSTRIBUTION  MATRIX  GSa,.J)..  IS:" 
908  FOR  I ==6T0 1 2 : PR  I NTTRE  < 1 0 ) ; : FOR  J=  1 T06 : PR  I NTFNR  <G  1 ( I ..  J ) ) : NEXTJ  : PR  I NT  : NEXT  I 
910  END 
920  REM 

930  REM  TETRfl  .'is 
940  REM 


950  REM  fiCOSS  TETRAHEDRON.; X=<X1 , VI . 21 , X2/t'2 
960  REM  U=';H2,.  V2.  H3..  V3.  H4..  V4> ; 4.-'26.-'82  Jfs 
970  REM 

'S0  REM  ENTER  DATA  HERE  ;<s 
...^0  REM 


V3.23..X4..V4.Z4) 


1000  REM  ENTER  X..'tUZ  COORDINATES  OF  JOINTS  1 THRU  10  * 

1010  DATA  0..0^  10.  165j-5.. -2.S87>2/5..-2.887..2..0..5.7735.. 2.. -6.. -1. 1547..0 
1020  DATA  -4 . -4 , 6 1 88 ..  0 ..  4 . -4 . 6 1 88 ..  0 ..  6 - 1 . 1 547 . 0 . 2 y 5 . 7735 ..  0 . -2 . 5 , 7735 ..  0 
1030  REM  if:  ENTER  STIFFNESS.-''MRSS  FOR  HEAVV  MEMBERS.  LIGHT  MEMBERS.  & LEGS 
1040  DATA  .5. .05.2.2097 
READV. 


.iN 
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STIFFNESS/MfiSS  MFiTRIX..  UPPER  LEFT  GUpnRflHT 
-2.5  0 0 1.25  .7218  2.8413 

0 -2 . 500 1 -2E-04  .7218  ,4167  1,1 786 

0 -2E-04  -10.0001  2.0413  1.1786  3.3334 

1.25  .7218  2,0413  -68. 1694  -14.7184  “2,0413 

.7218  .4167  1.1786  -14.7184  -51.1771  -1.1764 

2.0413  1,1786  3.3334-2.0413-1.1764-21.011 

UPPER  RIGHT  QURDRflNT 

1.25  — .i'’2i8  — 2.0413  0 0 0 

— ,721i3  .4167  1,1786  0 1,6667  -2,357 

-2,0413  1.1786  3.3334  0 -2.357  3.3334 

50  0 0 12.5  21.6513  8 

0 0 0 21.6513  37,5021  0 

0 0 0 0 0 0 
LOWER  RIGHT  QUflDRflNT 

-68.1694  14.7184  2,0413  12.5  -21,6513  8 

14.7184  -51.1771  -1,1764  -21,6513  37.5021  8 

2.0413  -1.1764  -21,011  0 0 0 

12,5  -21,6513  0 -42,6776  8 8 

-21.6513  37,5021  8 8 -76.6709  2.357 

0000  2.357  -21,011 

mi  I -SVMMETR I C ST I FFNESS/MflSS  MflTR I K : 

-2,5  2.5  1.4435  4.0825  0 

1.25  —18.1694  — 14.i''l84  — 2,8413  12,5 

.7218  -14.7184  -51.1771  -1,1764  21.6513 

2.0413  -2,0413  -1.1764  -21.011  8 

0 25  43.3025  8 -42.6776 

SVMMETR I C ST I FFNESS/MflSS  MflTR I / : 

-2.5001  -2E-04  1.4435  .8335  2.3572  1 . 6667  -2.357 

~2E~04  -10.06101  4.£i825  2.3572  6,6667  -2.357  3.3334 

.7218  2.0413  -118.1694  -14,7184  -2.0413  21.6513  8 

,4167  1,1786  -14,7184  -51.1771  -1,1764  37,5821  0 

1.1786  3.3334  -2.8413  -1.1764  -21.011  8 8 
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APPENDIX  B.  MODAL  FORMS 


An  eigenvalue/eigenvector  code  was  used  to  find  the  natural  frequencies  and 
mode  shapes  from  the  symmetric  and  anti-symmetric  equations  of  motion  determined 
in  Appendix  A.  The  results  are  shown  below  and  on  the  next  page. 
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I.  INTRODUCTION 


The  need  of  active  control  of  large  space  structures  is  well-known  hence 
needs  no  reiteration  here.  The  highly  complex  and  interactive  nature  of  the 
problem  gives  rise  to  numerous  difficulties  which  have  been  well  documented  by 
BALAS  (Ref.  1).  Due  to  the  inherently  high  order  of  the  structure  and  the 
fact  that  only  a finite  number  of  sensors  and  actuators  is  available,  the  use  of 
modern  optimal  control  theory  for  vibration  suppression  purposes  seems  unjusti- 
fied as  it  inevitably  encounters  the  problem  of  control  and  observation  spill- 
over. ilore  specifically,  active  control  of  low  frequency  modes  by  such 
techniques  causes  intermediate  or  high  frequency  modes  to  become  unstable,  as 
demonstrated  both  in  theory  and  in  experiment  (see  Refs.  2 and  3).  Sophisticated 
compensation  techniques  can  be  used  to  overcome  this  problem  but  nevertheless 
lead  to  further  complication.  Another  much  simpler  technique  is  the  use  of 
colocated  velocity  (rate)  feedback  control  (see  Refs.  4 and  5)  which  guarantee 
that  all  modes  remain  stable.  Spillover  still  exists  in  this  case  but  rather 
than  being  detrimental,  can  stabilize  the  uncontrolled  and  unmodelled  modes. 
However,  there  still  remains  a crucial  problem,  unfortunately  ignored  by  most 
researchers  in  this  field,  namely  the  interaction  of  actuator  dynamics  with  the 
structure.  It  is  thus  the  central  objective  of  this  paper  to  point  out  that 
actuator  dynamics,  if  not  properly  treated,  may  give  rise  to  instability.  We 
shall  also  suggest  a couple  of  ways  to  overcome  this.  The  analysis  will  be 
carried  out  in  the  context  of  colocated  feedback  control  though  it  can  be 
carried  out  in  terms  of  optimal  control  just  as  v^ell.  In  the  present  paper, 
we  shall  present  only  a brief  summary  of  the  concepts  and  theory  involved; 
interested  readers  may  like  to  refer  to  Refs.  6 and  7 for  further  details. 


II.  NATURE  OF  THE  PROBLEM 


A large  but 
back  control  can 


finite  order  (N)  vibrating  system  with  colocated  rate  feed- 
be  modelled  by  a second  order  equation: 


Mx  + Dx  + Kx  = -S^CSx 


*Professor  of  Applied  Mechanics 


**Graduate  Student 


[M  is  the  inertia  matrix  including  the  inertia  of  the  sensors  and  actuators] 
where  the  sensor/actuator  (S/A)  location  matrix  S is  defined  by 


fl  if  S/A  pair  i is  located  at  element  j 

(S)i-  = (2) 

0 otherwise 


Applying  the  appropriate  orthogonal  transformation 


x=H  (3) 

such  that  = I,  = V,  where  V is  in  general  non-diagonal 

but  diagonally  dominant  and  q is  diagonal  with  the  square  of  the  modal 
frequency  down  the  diagonal.  Eqn.  (1)  is  reduced  to 

'^  + Vt  + = -$"’^s'’^CS$4  = -BS  (4) 


Thus,  the  addition  of  feedback  simply  modifies  the  closed  loop  damping 
matrix  to 


V = V + B 


(5) 


It  is  easy  to  prove  that  (see  Refs.  6 and  4)  if  V and  o.  are  positive 
definite  matrix,  then  the  system  is  stable.  Furthermore,  if  the  elements  of 
V are  relatively  small  compared  to  iJ,  then  the  dynamical  behavior  of  the 
coupled  system  is  basically  dominated  by  the  diagonal  elements  of  f>.  This 
can  be  shown  easily  using  perturbation  theory.  If  we  rewrite  V as: 


t>  = eV* 


(6) 


where  V*  is  of  the  same  order  of  magnitude  as  and  e is  a representative 
small  parameter,  then  the  eigen  values  of  the  close  loop  system  is  given  as 
the  zeroes  of 
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( S + ed-|  1 S + (jJ-|  ) 


det 


ed 


21 


ed 


12 

* 

22' 


ed 


13 


ed 


2 2\ 

(s  + ed^^S  + (jj2  ) ^*^23 


IN 


'2N 


ed 


N2 


+ a, ^2)^ 


or 


n (s^  + d..s  + 0).^)  + O(e^)  = 0 
i = 1 ^ 


(7) 


where  lower  case  letters  denote  elements  of  corresponding  upper  case  matrix. 
Hence  to  1st  order  accuracy,  the  system  decoupled  into  N scalar  vibrating 
system  with  damping  dictated  by  the  diagonal  term  d.j^.  Subsequently  if  Nn 
S/A  pairs  are  available,  we  can  approximately  assign  the  closed  loop  poles  ^ 
arbitrarily.  One  way  of  doing  such  is  to  decide  on  what  the  closed  loop 
damping  of  the  first  N^  modes  are,  which  is  equivalent  to  prescribing  the 
first  diagonal  elements  of  the  modal  gain  matrix  B.  Thus  if  we  choose 
the  first  x N^  top  left  corner  block  of  B as 


, b.  >0  (8) 


fb. 


^11 


bo  0 


0 


Aj 


then  the  gain  matrix  C can  be  computed  uniquely  as 


C = 


(Js"^) 


-1 


B-|i  (Sf-j ) 


-1 


(9) 


where 


Na  _ N-Na 
= N [ T .|  ; $2  ^ 


(10) 


and  C is  positive  definite  by  virtue  of  the  positive  definiteness  of  B.|p 
the  modal  gain  matrix  B must  therefore  be  positive-semi-definite  also,  and 
it  takes  the  form 
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B = $'^S’*’CS$  = 


(11) 


(S$2)^(S$i)"^B^^(S$^)"^(S$2) J 


Note  in  passing  that  controllability  and  observability  of  the  system  are 
equivalent  in  this  case  by  virtue  of  the  colocation,  and  is  identical  to  the 
invertibility  of  the  matrix  (S$, ).  This  is  easily  achieved  by  placing  the  S/A 
pairs  away  from  the  nodes  of  the  controlled  modes.  In  the  space  structure  K 
is  only  positive  semi -definite.  However,  if  yT(D+B)y  is  positive  definite  for 
any  vector  y orthogonal  to  (D+B)Z=0  for  Z €N|^,  then  the  vibrational  modes 

are  Liapunov  asymptotic  stable. 

We  have  in  essence  furnished  a brief  review  of  the  theory  of  colocated 
rate  feedback  control.  Further  details  will  be  found  in  ref,  5.  VJe  shall  concern 
ourselves  more  with  the  interaction  of  actuator  dynamics.  In  practice,  the 
actuators  inevitably  possess  inertia  hence  cannot  be  regarded  as  memoryless 
devices.  This  leads  to  a finite  actuator  bandwidth.  BALAS  [ref.  8]  argues 
that  by  ignoring  the  fast  modes,  the  actuator  dynamics  can  be  ignored  if  it  is 
sufficiently  fast.  This  argument  is  questionable  since  firstly,  omittance 
of  fast  modes  is  not  justifiable  and  secondly,  no  matter  how  fast  the  actuator 
dynamics  are,  they  cannot  be  faster  than  all  the  system  modes  whose  bandwidth 
eventually  become  infinite.  This  is  one  of  the  many  fallacies  which  arises 
in  flexible  spacecraft  control  to  date,  in  that  we  are  using  finite  resources 
to  control  an  infinite  dimensional  system  and  hence  ignorance  or  truncation  of 
the  infinite  dimensionality  is  never  justifiable. 

With  the  presence  of  actuator  dynamics,  eqn,  (1)  is  modified  to  be: 

MX  + Dx  + Kx  = -S^Cz  (12) 

[z  " 


Applying  the  appropriate  canonical  transformation  and  letting 

z = S$n  , (.14) 

eqns,  (12)  and  (13)  are  equivalent  to 

K + vi  + - Bn  = -$'’^s'^cs$n 


I (S$2y^(S$-|)"’^Bi^ 


(15) 


(16) 


I„[n  + 63T,  + 

As  before,  if  the  damping  matrices  V and  8 are  relatively  small,  we  can 
regard,  to  first  order  accuracy,  the  coupled  system  as  N decoupled  scalar 
system.  This  is  a useful  concept  which  will  enable  us  to  investigate  instability 
via  classical  control  technique.  A typical  reduced  scalar  system  is  governed  by 


C,  + 6,  5,.  + -b,ri, 


07) 


n,  + Bg  tii  + »3(rii-£,)  = 0 , (18) 

For  rotational  convenience  we  shall  discard  the  index  i from  5-j  and  ni . 
Henceforth  the  symbols  ? and  n are  regarded  as  scalars  quantities  until  otherwise 
specified.  By  taking  Laplace  Transform,  the  combined  system-actuator  dynamics  can 
be  shown  easily  to  be  governed  by  the  characteristic  equation; 


(s^ 


+ B^.s  + 


a)?)(s^  + 3gS  + 


X K 2^ 
(^a)  + bi  o)gS 


(19) 


Using  conventional  Nyquist  diagram  methods,  it  can  be  shown  that,  for 
stability,  the  scalar  gain  b^.  has  to  be  upper  and  lower  bounded  by 


Y, 


(20) 


where 
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These  are  fairly  messy  expressions  but  nevertheless  can  be  slightly 
simplified  if  terms  in  O(c^)  are  neglected.  A typical  plot  of  these  bounds  as 
a function  of  w,  with  as  fixed  parameters  is  shown  in  Fig.  1.  For 

I ana 

non-trivial  the  shaded  .area  defines  the  stable  region,  i.e.,  for  any  fre- 
quency mode  of  frequency  wf  to  be  stable,  the  gain  (positive  or  negative)  is 
required  to  be  bounded  by 

. (23) 


For  trivial  the  stability  boundaries  are  shown  by  broken  lines,  part 
of  which  coincides  with  the  W:j  axis.  They  also  partition  the  stability  region 
[for  a system  of  non-trivial  into  four  sectors.  We  shall  defer  the  significance 
of  these  sectors  till  later  but  as  decreases,  the  stability  region  shrinks 
correspondingly.  It  is  now  easy  to  see  why  instability  can  occur.  Suppose 
in  the  process  of  designing  the  gain  matrix  C,  large  values  of  scalar  gain  b. 's 
occur  on  the  diagonal  of  B,  so  much  so  that  it  lies  outside  the  stability  boundary, 
then  the  corresponding  mode  will  result  in  instability.  Such  large  b.  can  result 
in  several  ways,  e.g.,  the  prescribed  damping  is  too  high,  the  matrix  ^S^, ) is 
near  singular  due  to  a poor  choice  of  S/A  location,  or  simply  because  the 
natural  damping  is  so  low  that  the  stability  region  is  not  large  enough  to  accom- 
modate even  reasonably  small  gain.  One  has  to  bear  in  mind  that  this  argument 
is  only  accurate  to  first  order  since  coupling  terms  have  been  ignored,  nevertheless 
it  works  pretty  well  even  in  actual  simulation.  It  is  also  obvious  from  Fig,  1 
that  those  modes  whose  frequencies  are  around  co  to  roughly  2w  are  most  vulnerable 
to  instability  as  the  stability  boundary  (y^)  is  relatively  low  in  this  range 
[we  shall  call  it  the  "critical  frequency  range"  henceforth].  For  zero  Cn  and 
positive  gain,  it  is  also  true  that  all  modes  with  frequency  greater  than  u 
are  unconditionally  unstable.  Fortunately  actual  c occurring  in  space  is  hot 
zero  even  though  it  may  be  very  small  indeed.  The  smaller  is,  the  larger  is 
the  critical  frequency  range  and  the  more  likely  instabil ity^is  to  occur.  In 
any  case  the  resulting  closed  loop  damping  of  all  higher  modes  will  be  lower  than 
their  natural  damping.  This  can  be  explained,  qualitatively  in  terms  of  root 
locus  diagram  but  instead  we  shall  provide  a more  quantitative  analysis. 

Suppose  we  equate  the  characteristic  equation  (19)  to  the  closed  form 
expression 


(s'-  + BpS  + o)p)(s^  + BpS  + i/)  = 0 (24) 

If  we  expand  both  eqns,  (19)  and  (24)  and  equate  coefficients  in  powers  of  s, 
four  algebraic  equations  will  result  which  enable  us  to  solve  for  the  four  unknown 
Ys  B-j  doq  with  B prescribed a-priorily.  After  some  algebra  it  can  be  shown 
that^the*^  gain  corresponding  to  a prescribed  damping  B is 

r 


Y 


:=  -B±j 


B^-4AC 
2A 


S'-  - 4AC  > 0 


(25) 
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where 


A = 

a 

(26) 

® f S^G,  - 2Gp 

(27) 

C = (BpG,  - G2)(S^G,-G2)  + (Bp 

o ^2  2 2 

- 3 ) to.  (jO 

q'  1 a 

(28) 

% = e,  B,  - Bp 

(29) 

2 2 

G,  = 0.1  + + 6i6p  - BpBp 

(30) 

Gj  :=  6i»2  h-  BpO.2 

(31) 

Since  wp  is  expected  to  be  close  to 


0). 


3 can  be  related  to  the  prescribed  damping 


for  sufficiently  low  gain,  the  damping 
ratio  5 approximately  by 


6.  = 2 


(32) 


The  locus  of  y vs  w,  for  fixed  ? are  plotted  in  Fig.  2.  For  Cp  = 
the  locus  coincide  with  the  stability’^boundaries.  For  ?p  = Cp,  the  locus  coin- 
cide with  the  stability  boundary  for  c =0  and  within  each  of  the  4 sectors 
divided  by  this  locus 

In  Sector  I , y > Cp  > 0 
In  Sector  II , y > 0,  Cp  < 0 
In  Sector  III,y<0,  <0 
In  Sector  IV,  y < 0,  Cp  > 0 

We  also  see  that  as  (jj,  gets  closer  to  co  , the  maximum  attainable  damping 
gets  smaller.  Furthermore  for  w,  > w , less“than  natural  damping  will  always 
result  if  positive  gain  is  used.  Eve^  if  negative  gain  is  used,  very  high  gain 
is  required  just  to  achieve  reasonable  low  damping. 

Now  that  sufficient  physical  insight  to  the  problem  has  been  gathered,  we 
shall  consider  ways  of  overcoming  these  difficulties. 
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III.  RATE  FEEDBACK  WITH  LEAD  COMPENSATION 


Lead  compensation  has  been  used  extensively  in  classical  control  theory  to 
increase  the  phase  margin  of  marginally  stable  system,  but  is  usually  applicable 
for  single-input-single-output  system  whereby  scalar  transfer  junction  is 
realizable.  In  our  multi -dimensional  case,  however,  scalar  lead  compensation 
is  generalized  to  vector  lead  compensation,  again  by  involving  the  argument 
that  the  N'^'^  order  coupled  system  can  be  approximated  by  N decoupled  scalar 
systems.  There  is  no  rigorous  theoretical  justification  to  such  but  we  only 
know  that  it  works  fairly  well  in  actual  simulation. 

The  concept  of  lead  compensation  as  applied  to  scalar  system  is  well-known 
hence  needs  no  elaboration  here.  In  particular  it  works  very  well  in  this  case 
because  of  the  highly  resonant  nature  of  the  system  (due  to  small  c ).  The 
increase  in  gain  by  compensation  only  leads  to  very  small  change  of”  cross-over 
frequency  but  the  corresponding  increase  in  phase  is  substantially  high  enough 
to  reverse  the  phase  margin  of  critical  modes  from  negative  to  positive.  A very 
detailed  discussion  of  this  can  be  found  in[6]by  means  of  Bode's  diagram  argument. 
We  shall  only  generalize  the  scalar  argument  into  the  multivariate  case.  With 
the  inclusion  of  a vector  lead  compensation  network,  the  combined  dynamics  are 
modified  to 


1 — 1 

: 

+ 

+ 

(33) 

!„^[n  + BjT, 

4.  2 n . 2 . 

^ “a”^  = ^N^  “a  P 

^A 

n,p  £R 

(34) 

I„  [T,p  + p- 

] = + (Sf^) 

T^,T2  €R,  T^  <T^ 

(35) 

Conceivably,  T,  and  Tp  can  be  generalized  to  be  positive  definite  matrices 

for  perhaps  better  performance  but  for  the  moment  we  shall  assume  them  to  be 

scalars.  The  choice  of  T,  and  T2  is  a debatable  art,  the  complete  behavior  of 

eqns.  (33)  - (35)  is  extremely  complicated;  the  best  we  can  do  is  to  guess  by 

scalar  inference.  Intuitively,  we  would  like  the  increas^  in  phase  shift  to 

start  from  oa,  = , hence  To  is  chosen  to  be  around  To  = — . The  choice  of  T, 

1 a ’ 2 2 1 

a 

is  related  to  the  size  of  the  increase  in  phase  shift,  the  larger  T,  is  the  more 
the  increase.  If  T,  is  too  small,  then  the  increase  in  gain  will  b4  too  high 
but  if  T,  is  too  large,  the  increase  in  phase  shift  will  not  be  sufficient  to 
create-j-positive  phase  margin  for  the  critical  modes.  As  a good  rule  of  thumb,  the 
ratio  ^ is  chosen  to  be  between  5 and  10. 

Tl 

In  section  V we  shall  demonstrate  by  simulation  the  feasibility  of  this 
technique. 
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IV.  POSITIVE  POSITION  FEEDBACK  WITH  TUNING  FILTERS 


As  before,  we  shall  look  at  the  concept  of  positive  position  feedback  from 
the  scalar  point  of  view.  Here  again  regard  ^ and  n as  scalar  representing  the 
states  of  the  system  and  the  actuator  respectively,  as  described  by 

^*+  3.jC  + " b.n)  = 0 b.>  0 (36) 

n + 3^ri  + to^(n  - 5)  = 0 (37) 

a a 

which  has  the  characteristic  equation 

(s^  + 3.S  + 01?)  (s^  + = 0 , (38) 

I I a a I I a 

It  is  trivial  to  show  that  a sufficient  and  necessary  condition  for  stabil ity 
in  this  case  is 


b.  < 1 (39) 

which  is  also  the  stability  boundary.  This  obviously  is  much  simpler  than  the 
stability  boundary  of  the  rate  feedback  case.  Furthermore,  it  has  the  nice 
property  that  it  is  independent  of  all  the  system  and  actuator's  dynamics  1 
To  examine  the  closed-loop  damping  characteristic  of  position  feedback,  we  com- 
pute the  eigen  values  of  eqn.  (38)  and  plot  the  real  part  [represent  damping]  of 
the  eigen  values  vs  to-,  for  fixed  b. , ^ and  Wg,  The  resulting  closed-loop 

damping  characteristic  is  as  shown^in  Fig. “3.  It  can  be  shown  [see  ref.  7]  that 
the  characteristic  consists  of  two  parts,  the  upper  and  the  lower  locus  which 
correspond  to  the  actuator  and  the  system,  respectively.  For  sufficiently  low 
gain,  the  system  locus  exhibits  a resonance  peak  to  the  left  of  co  , For  higher 
gain  the  two  locus  meet  at  a cusp  which  can  be  shown  to  be  at  approximately 

(co  7 1 - ^[?  + K //l-?^])-  Thus  for  modes  near  the  resonance  peak,  the 

clBsed-loopdampin|  can'^be  maBy  times  that  of  the  natural  damping  [the  lower  asymp- 
totes] while  modes  away  from  the  peak  only  have  slightly  higher  than  natural 
damping. 

It  is  such  a property  that  motivates  us  to  the  use  of  "Tuning  Filters". 
Basically  these  are  band-limited  electronic  devices  with  dynamics  similar  to 
that  of  the  actuator  but  with  a frequency  "tuned"  to  the  controlled  mode  frequency 
in  order  to  enhance  its  closed  loop  damping. 

Suppose  that  there  are  as  many  filters  as  controlled  modes,  then  the  damping 
and  frequency  of  each  filter  is  carefully  chosen  so  that  its  corresponding  con- 
trolled frequency  lies  near  the  resonance  peak  [or  the  cusp]. 
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The  overall  system  is  now  described  by 


System  Mx  + Dx  + Kx  = S z 


C401 


Filters  I^^Ey.  + 2?^  y.  + y.]  = cVhx,  i=l,-  N^  (41) 

'XI'l 

Actuator  z + B^z  + z = E C^.  [y^.  + 3^^  y^-  + (42) 

' N 

x€R'\;  y.,zeR  ^ 


where  tV^  cV^  = C. 


(43) 


is  the  gain  matrix  for  the  i^*^  filter  which  has  natural  frequency  and  damping 
ratio  the  same  for  all  filters, 

Eqn.  (42)  implies  that; 


N, 


J/2 


(i)  Lim  z = Z CO.  y. 

t-Ko  i=l 


(ii)  If  z(o)  = z(o)  = y^(o)  =y^-(o)  = 0 Vi 


z(t)  = 0)^ 

i=l  M ^ ^i 


(44) 


so  for  sufficiently  large  3a » z converges  to  the  RHS  of  (44)  rapidly  and  con- 
sequently the  actuator  dynamics  falls  out  of  the  picture  completely.  Equations 
(40)-(43)  just  reduce  to,  after  appropriate  canonical  transformation, 


N. 


System  K + t>k  + [E  u.  y.] 

i=l  ■^i  ^ ^ 


(45) 


Filter  I|^  [y.  + y.  + or  y^  = cV^  S$?,  i=l,-^,N 


(46) 


'A 


128 


or 


where  = cV^  S$  ^ (48) 

The  symmetrical  structure  now  enables  us  to  prove  for  conditional  stability 
as  in  the  following  theorem. 


Theorem  1 : The  combined  system-filters  dynamics  are  stable  if 


where 


- E (1  + £.)  8.  is  positive  definite 
i=l  ^ ^ 

8.  = $V  C.  S$ 


and  are  some  arbitrarily  small  positive  quantities. 


(49) 

(50) 
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Proof:  Define  the  Liapunov  function 


Nf  ^ 

+ 5:  yl  y.]  + I + E 

i=l  ^ ^ i=1 


r 2 T 

K/i^i 


2?^  eI  y.]] 


(51) 


Cauchy  Schwartz  inequality  implies  that 


“f 

25'  e{  y,  < (H-  e.)  cT  eT  E.C.^yTy. 


(52) 


hence 

V > i d'^i  + Z yl  y.]  + i E (1  +e.)B  )5] 

^ i=l  1 ^ i=1  1 T 


1 ^ 2 ^i  T 


Differentiating  eqn.  (51)  with  respect  to  time,  we  obtain 

Nf 

= 2^  0)  yly.s  0 

i=l  ■ ^ ^i  ^ ^ 

V H 0 iff  1 = 0,  y.  = 0 V. 


(53) 


(54) 


Thus  if  the  assumption  is  true,  V is  strictly  positive  with  V negative  semi- 
definite  and  hence  the  combined  dynamics  is  stable  by  invoking  the  well-known 
Liapunov  direct  method. 

It  can  be  shown  further  [see  ref.  7]  that  a both  sufficient  and  necessary 
condition  for  stability  is  that  the  matrix  P in  eqn.  (47)  be  positive  definite. 


To  specify  EXACT  damping  for  all  the  controlled  modes  will  result  in  solving 
an  unmanageably  large  set  of  simultaneous  non-linear  algebraic  equations.  For  not 
so  large  prescribed  damping,  however,  we  can  ignore  the  coupling  effects  from  the 
uncontrolled  modes  and  the  first  modes  can  be  reduced  to  N»  decoupled  scalar 
systems  with  small  perturbation  error.  Suppose  that  one  filter  is  available  for 
each  controlled  mode,  then  the  characteristic  equation  for  the  i^”  mode  is, approx- 
imately 

(s^  + 3..S  + 60?) (s^  + Bf_s  +4.)  - ^'4  4."  ° 
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Consequently  if  the  closed-rloop  damping  8^^  is  specified,  the  four  unknowns 

0)  ,0)  ,8  ,b.  can  be  computed  by  comparing  eqn.  (55)  with  the  closed-rloop  ex- 

^i  Pi  Pi  ^ 

pression 

(s^  + 8„  s + 03^  )(s^  + 8^  s + 0)^  ) = 0 (56 


After  some  algebra,  the  solutions  are^ 


Bq,  ' 6,  + - 6,, 

8l,  = ^ 8,6,.  - Bp. 6,. 


Gj  = 6,  + B,.m? 


“p.  - <®2,  - 8p.  8,.)/(6q.  - 6 ,) 


“q,  ° ' ®q,  " ®q,> 


b.  = 1 - 


VS 

(jJi 


Once  the  scalar  gain  b.  for  the  i controlled  mode  is  obtained,  the  cor- 
responding gain  matrix  C..  c3in  be  computed  from 


(s-k-,)' 


The  gains  for  the  uncontrolled  modes  will  subsequently  be  by-products  of 
these  b.j ' s . 
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Hitherto,  the  choice  of  and  has  remained  arbitrary.  However,  they 
can  be  chosen  such  that  the  following  iptimum  criteria  are  achieved. 

(i)  For  a fixed  prescribed  closed-loop  damping,  the  corresponding  scalar  gain 
is  to  be  as  small  as  possible. 

(ii)  The  design  should  be  robust  in  the  sense  that  slight  change  in  the 
modal  structure  will  not  cause  high  gain  to  occur  which  consequently  causes  in- 
stability. 

(iii)  If  instability  ever  occurred,  we  prefer  it  to  occur  in  the  filter 
[which  has  inherent  saturation  characteristics]  and  not  in  the  system.  In  other 
words,  crashworthiness  is  desired. 


The  first  criterion  can  be  easily  achieved  by  designing  and  such  that 

the  damping  for  a controlled  mode  be  exactly  on  the  cusp  of  tlie  damping  character- 
istic. However,  this  is  not  a very  safe  thing  to  do  since  slight  change  in  the 
controlled  mode  frequency  is  likely  to  cause  high  gain  (and  consequently  in- 
stability) to  occur.  Thus  the  first  and  second  criteria  are  unfortunately  in 
conflict.  A reasonable  compromise  is  to  place  the  damping  of  the  mode  20%  below 
the  cusp  to  allow  for  some  lateral  clearance.  The  third  criterion  can  be 
achieved  by  placing  the  cusp  slightly  to  the  left  of  the  controlled  frequencies. 
This  can  be  explained  very  simply  by  means  of  Root  Locus  diagram  as  shov*;n  in 
Fig.  4(i),(ii)  and  (iii). 


With  these  facts  in  mind,  a reasonably  "optimum”  choice  of  the  filter's 
characteristics  can  be  shown  to  be 


(1.02  c )^ 

^ rr  « - - - - 

^ 0.16  + (1.02  ^ f 

I P 


(65) 


Note  in  passing  that  even  though  b.  < 1 is  required  for  stability,  we  in 
general  want  to  keep  b.  to  be  less  than'about  0.8  to  avoid  the  degeneration  into 
2 real  roots  as  shown  In  Fig.  4,  because  then  one  root  will  be  less  stable  than 
the  other. 

Lastly,  we  have  so  far  assumed  that  a full  complement  of  filters  are  avail- 
able; this  is  in  general  not  necessary.  If  less  than  N»  filters  are  available,  we 
would  use  one  filter  each  for  the  lower  modes  since  they  are  more  vulnerable  to 
instability,  and  several  higher  controlled  modes  will  share  the  remaining  filters. 
The  damping  specification  still  works  out  the  same  except  that  now  we  expect 
slightly  poorer  performance  since  we  are  using  less  resources  to  do  more  work. 
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IV.  NUMERICAL  SIMULATION  AND  RESULTS 


For  illustration,  we  shall  consider  the  control  of  a simply  supported  dis- 
crete shear  beam  of  N elements,  The  corresponding  mass  and  stiffness  matrix 
take  the  form 


M = 

%^N 

(66) 

2 

^1  0 

0 •- 

N 

0 

-1 

2 -1 

0 

• 

1 

-1  2 

-1 

(67) 

1 

t 

4 

* 

-1 

0 - • 

-1 

2 

We  do  not  know  the  exact  form  of  the  damping  matrix  D but  do  know  that  the 
canonical  damping  matrix  V is  diagonally  dominant,  so  for  simulation  purpose  we 
shall  assume  that 


iV)..  2 w.  8.. 


(68) 


where  is  the  natural  frequencies  of  the  modes,  and  is  easily  shown  to  be 


sin 


TT1 


zTN+ry 


i=l,— N 


(69) 


with  corresponding  modal  matrix  given  by 


As  we  would  like  the  first  N„  modes  to  decay  at  a same  rate  (as  opposed  to 
having  the  same  damping  ratio)  we^shall  prescribe  an  approximate  closed-loop  damping 
ratio  of  t to  the  first  mode,  and  then  set  the  damping  of  all  controlled  modes 
to  be  the  same  as  the  first,  i.e.. 


2 


i=l 


-N, 


(71) 


For  comparison  purposes,  we  shall  carry  out  simulation  for  each  of  the  four 
following  cases.  In  each  case,  the  common  parameters  are  N=20,  N„  = 4,  S/A 
location  at  {3,8,13,19},  = 0.01,  Cp  = 0.3,  mQ=0.25. 

Case  1 : Colocated  rate  feedback,  no  actuator  dynamics  present,  feedback  gain  com- 

puted according  to  eqns.  (8)  and  (9).  The  closed-loop  damping  and  frequency  for 
each  mode  are  computed  from  the  eigen  values  of  the  system  in  eqn.  (4). 
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Case  2;  Colocated  rate  feedback  with  actuator  dynamics  given  by  Wg  = 1.0, 

= 0.7;  No  compensation;  The  feedback  gains  are  the  same  as  in  Case  1;  the 
closed-loop  damping  and  frequency  are  computed  from  the  eigen  values  of  the  system 
in  eqns.  (15)-(16). 

Case  3:  Colocated  rate  feedback  with  same  actuator  dynamics  as  Case  2;  com- 
pensation with  T,  = 0.2,  and  T2  = 1.0  is  included;  the  feedback  gains  are  the 
same  as  in  Case  1 and  2;  the  ciosed-loop  damping  and  frequency  are  computed 
from  the  eigen  values  of  the  system  in  eqns.  (33) -(35). 

Case  4.  Colocated  positive  position  feedback  with  actuator  dynamics  eliminated, 

4 tuning  filters  are  used  to  control  first  4 modes;  the  filter's  characteristics 
are  computed  from  equations  (64)-(65);  the  closed-loop  damping  and  frequency  are 
computed  from  the  eigen  values  of  the  system  in  eqns,  (45) -(46). 

The  outcome  of  these  simulations  are  tabulated  in  Tables  1 and  2. 

The  outcome  of  the  simulations  can  be  summarized  by  the  following  points: 

(1)  In  Case  1 where  actuator  dynamics  are  absent,  colocated  rate  feedback 
control  works  very  well  and  the  controlled  modes  closed-^loop  damping  is  fairly 
close  to  the  prescribed  value.  Furthermore,  all  uncontrolled  [and  expectedly,  un- 
modelled] modes  result  in  higher  than  natural  damping  of  1%. 

(2)  In  Case  2 where  actuator  dynamics  are  included,  we  found  that  the  4th, 
5th  and  6th  modes  are  unstable.  If  the  scalar  gains  for  these  modes  are  plotted 
onto  Fig,  1,  they  are  found  to  lie  outside  the  stability  region,  Hence,  this 
justifies  the  fact  that  we  can  predict  stability  of  a higher  order  coupled 
system  just  by  looking  at  the  diagonal  elements  of  3»  at  least  up  to  first 
order  accuracy  anyway.  Furthermore,  the  uncontrolled  modes  are  found  to  have 
less  than  natural  closed-loop  damping,  again  consistent  with  our  previous  theory. 

(3)  In  Case  3,  a lead  compensation  network  is  used  to  raise  the  phase 
margins  of  the  unstable  modes  and  it  was  found  that  all  modes  which  are  previously 
unstable  are  now  stable.  Furthermore,  the  controlled  modes  also  result  in  sub- 
stantially higher  damping  than  the  uncompensated  case.  Unfortunately,  this  is 
upset  by  the  decrease  in  damping  of  the  higher  uncontrolled  modes, 

(4)  In  Case  4 where  positive  position  feedback  with  4 tuning  filters  is 
used,  the  closed-loop  performance  surpasses  the  previous  case  significantly. 

The  closed-loop  damping  of  the  first  mode  is  slightly  less  than  specified 
[within  6%  anyway]  but  all  4 controlled  modes  have  fairly  uniform  damping, 

Better  still,  all  uncontrolled  [and  expectedly,  unmodelled]  modes  result  in 
higher  than  natural  damping. 

In  addition  to  what  has  been  observed,  position  feedback  has  several 
advantages  over  rate  feedback  as  summarized  below? 
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Rate  Feedback  with  Compensation 


Position  Feedback  with  Tuning  Filters 


1 . No  theorem  can  be  proven  to  guarantee 
global  stability. 

2.  Stability  is  highly  dependent  on  the 
natural  damping  of  the  structure, 
which  remains  quite  uncertain. 


3.  Maximum  attainable  closed-loop  damp- 
ing is  limited  depending  on  the  stabi- 
lity boundary. 


4.  Uncontrolled  and  unmodelled  modes 
result  in  lower  than  natural  damping. 

5.  Instability,  if  ever  exists, occurs 

in  the  intermediate  uncontrolled  modes. 

6.  Requires  rate  sensing, 

7.  Requires  accurate  knowledge  of  struc- 
ture than  is  available  in  practice. 


Theorem  pertaining  to  conditional 
global  stability  can  be  proven. 

Stability  is  almost  independent  of  any 
small  natural  damping,  thus  a priori 
design  of  the  control  needs  no  knowledge 
of  the  natural  damping. 

Maximum  attainable  damping  is,  at  least 
in  principle,  much  higher.  The  resonance 
peak  of  the  filters  damping  character- 
istics is  responsible  for  this. 

Uncontrolled  and  unmodelled  modes  result 
in  higher  than  natural  damping. 

Instability  can  only  occur  in  ill -designed 
filters. 

Requires  position  (relative)  sensing. 

Only  accurate  knowledge  required  is  the 
rigid  body  mode,  which  is  nevertheless 
trivial . 


V.  CONCLUDING  REMARKS 


In  retrospect,  the  problem  of  vibration  suppression  in  large  space  struc- 
ture has  been  a complex  one,  in  view  of  the  highly  complicated  and  interactive 
structure  and  the  vast  number  of  design  parameters  to  be  decided  upon.  In 
this  paper,  we  specifically  address  the  problem  of  potential  instability  caused 
by  actuator  dynamics  and  two  mys  of  overcoming  it.  Due  to  restriction  in 
space  allocation,  we  can  only  present  a brief  outline  of  the  theory  but  any 
missing  details  can  be  found  in  refs.  6 and  7.  It  is  hoped  that  this  paper  will 
stimulate  future  research  to  improve  existing  techniques  and  dig  out  finer  details 
which  we  may  have  unfortunately  neglected. 
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TABLE  I 


1 

Open  loop 
freq. 

Scalar 

gain 

R 

Case  1 

io  Actuator 
Dynamics 

Case  2 

Actuator  Dynamics 

0)  =1 .0  ? =0.7 

a a 

D . . 
1 1 

C-L 

freq. 

C-L 

d%) 

mmn 

mm 

same  for  a' 

1 4 cases 

1 

0.299 

0.1794 

-0.0944 

0.287 

31.28 

-0.1105 

0.345 

30.51 

2 

0.596 

0.1794 

-0.0977 

0.597 

16.16 

-0.0468 

0.673 

6.94 

3 

0.890 

0.1794 

-0.1001 

0.902 

11.03 

-0.0127 

0.951 

1.33 

4 

1.179 

0.1794 

-0.1020 

1.179 

8.62 

+0.00158 

1.227 

-0.13** 

5 

1.461 

0.1816 

-0.1085 

1.462 

7.40 

+0.00355 

1.494 

-0.24** 

6 

1.736 

0.2616 

-0.1612 

1.732 

9.27 

+0.00789 

1.766 

-0.45** 

7 

2.000 

0.2203 

-0.1307 

1.989 

6.56 

-0.0003 

2.018 

0.02 

8 

2.253 

0.0692 

-0.0544 

2.247 

2.42 

-0.0169 

2.251 

0.75 

9 

2.494 

0.2150 

-0.1376 

2.488 

5.52 

-0.0107 

2.503 

0.43 

10 

2.721 

0.1312 

-0.0932 

2.721 

3.42 

-0.0196 

2.725 

0.72 

11 

2.932 

0.1245 

-0.0931 

2.924 

3.18 

-0.0229 

2.935 

0.78 

12 

3.127 

0.1336 

-0.0996 

3.129 

3.18 

-0.0252 

3.130 

0.80 

13 

3.305 

0.0735 

-0.0635 

3.314 

1.92 

-0.0301 

3.306 

0.91 

14 

3.464 

0.0244 

-0.2215 

3.485 

6.35 

-0.0254 

3.468 

0.73 

15 

3.604 

0.0220 

-0.1309 

3.551 

3.68 

-0.0281 

3.607 

0.78 

16 

3.723 

0.1702 

-0.1272 

3.708 

3.43 

-0.0315 

3.726 

0.85 

17 

3.822 

0.3004 

-0.1823 

3.779 

4.82 

-0.0285 

3.826 

0.75 

18 

3.900 

0.1696 

-0.0880 

3.853 

2.28 

-0.0336 

3.901 

0.86 

19 

3.955 

0.1493 

-0.1012 

3.914 

2.59 

-0.0350 

3.957 

0.88 

20 

3.989 

0.1560 

-0.0962 

3.963 

2.43 

-0.0352 

3.990 

0.88 

Actuator  1 

■ -0.6367 

0.518 

77.55 

**  lln<;tahlp  tnndp 

Actuator  2 

-0.7310 

0.476 

83.78 

Actuator  3 

-0.7208 

0.534 

80.33 

in  all  cases 

Actuator  4 

-0.7263 

0.625 

75.78 
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Table  2 


Case  3,  Compensated  Rate 

Feedback 

Case  4,  Positive  Position  Feedback 

Tp=1.0,  T,= 
are  -5.202, 

0.2,  Compensator  roots 
-5.141,  -5.135,  -5.079, 

5^=0.6076 

= 0.3430 

=0.369 

" 

b^  = 0.2288 

o)f-2=0. 736 

b3  = 0.1  639 

(0^3=1  .099 

b^  = 0.1255 

o).f.^=l  .455 

C-L 

c-r 

C-L 

wmaamm 

damping 

frequency 

(/I 

O -0.5572 

0.676 

63.58 

t -0.1396 

0.249 

48.85 

« -0.6530 

0.651 

70.82 

^ -0.3565 

0.508 

57.47 

■|j  -0.5892 

0.585 

70.94 

-0.5523 

0.746 

59.50 

< -0.5799 

0.604 

69.25 

^ -0.7972, 

1.093 

58.92 

Mode 

1 -0.1111 

0.303 

34.45 

-0.0875 

0.296 

28.389 

2 -0.0978 

0.660 

14.65 

-0.0931 

0.584 

15.742 

3 -0.0505 

0.964 

5.23 

-0.0964 

0.883 

10.854 

4 -0.0359 

1.252 

2.86 

-0.0918 

1;158 

7.898 

5 -0.0226 

1.523 

1.49 

-0.0365 

1.473 

2.478 

6 -0.0164 

1.809 

0.91 

-0.0220 

1.742 

1.263 

7 -0.0136 

2.055 

0.66 

-0.0218 

2.001 

1.091 

8 -0.0190 

2.269 

0.84 

-0.0242 

2.255 

1.072 

9 -0.0143 

2.533 

0.57 

-0.0260 

2.495 

1.042 

10  -0.0202 

2.742 

0.74 

-0.0279 

2.722 

1.024 

11  -0.0221 

2.950 

0.75 

-0.0302 

2.934 

1.030 

12  -0.0239 

3.144 

0.76 

-0.0336 

3.131 

1.073 

13  -0.0294 

3.313 

0.89 

-0.0334 

3.306 

1.010 

14  -0.0220 

3.490 

0.63 

-0.0350 

3.465 

1.011 

15  -0.0231 

3.627 

0.64 

-0.0364 

3.604 

1.010 

16  -0.0277 

3.740 

0.74 

-0.0377 

3.725 

1.013 

17  -0.0202 

3.851 

0.53 

-0.0387 

3.823 

1.013 

18  -0.0271 

3.916 

0.69 

-0.0394 

3.901 

1.010 

19  -0.0302 

3.969 

0.76 

-0.0398 

3.956 

1.006 

20  -0.0293 

4.003 

0.73 

-0.0403 

3.990 

1.010 

138 


SCALAR 
GAIN  b; 


7u(cOi) 


STABLE  REGION 


fn  ^ 0 

Figure  1.  Stability  boundary  for  scalar  rate  feedback  system. 
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Figure  2.  Closed-loop  gain  characteristic  for  scalar  rate  feedback  system. 
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S€R 


N 

NxN 

NxN 

NxN 

NxN 


s 


u^i 

>Y 


®1  ' 


3,  = 2?  00 

a a 


zei  " 

'rpN 

n€R 


number  of  modelled  system  states 

number  of  sensor/actuator  pair  = number  of  controlled  modes 

system  state  vector 

system  mass  matrix 

system  damping  matrix 

system  stiffness  matrix 

sensor/actuator  location  matrix 

feedback  gain  matrix 

Canonical  (modal)  state  vector 

Canonical  (orthogonal)  transformation  or  modal  matrix 

Canonical  damping  matrix 

Canonical  stiffness  matrix 

modal  gain  matrix 

small  scalar  parameters 

null  space  of  matrix  K 

Laplace  transform  domain  variable 

natural  frequency  of  the  i^*^  mode 

scalar  gain,  diagonal  elements  of  B 

system  natural  damping  ratio 

twice  the  open  loop  damping  of  the  i^*^  mode 

actuator  damping  ratio 

twice  the  open  loop  damping  of  the  actuator 
actuator  natural  frequency 
actuator  state  vector 
Canonical  actuator  state  vector 
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prescribed  (approximate)  damping  ratio 

lead  compensator  time  constants 

tuning  filter  damping  ratio 

natural  frequency  of  the  i^*^  tuning  filter 

i^*^  eigen  value  of  relevant  systems 

Liapunov  function 

time  derivatives  of  ( • ) 


CONTROL  OF  ANTENNA-FEED  ATTITUDE  AND 
REFLECTOR  VIBRATIONS  IN  LARGE  SPACEBORNE 
ANTENNAS  BY  MECHANICAL  DECOUPLING  AND 

MOVABLE  DAMPERS 

P.K.C.  Wang*,  Jet  Propulsion  Laboratory,  California  Institute  of  Technology, 
Pasadena,  CA  91109;  E.C.  Hong  and  J.S.  Sarina**,  Department  of  System  Science, 
University  of  California,  Los  Angeles,  Los  Angeles,  CA  90024 


ABSTRACT 

Simple,  practical  methods  for  damping  reflector  vibrations  and  for  design- 
ing antenna-feed  attitude  control  systems  in  large  deployable  spaceborne  anten^ 
nas  are  proposed.  The  former  involves  a movable  damper  which  is  positioned  so 
that  the  rate-of-change  of  total  vibrational  energy  is  minimized.  The  latter 
introduces  a mechanical  decoupler  between  the  flexible  boom  and  the  antenna-feed, 
whereby  the  feed-attitude  control  system  can  be  designed  independent  of  boom 
dynamics.  The  validity  of  these  approaches  are  substantiated  by  analytical 
studies,  computer  simulation,  and  experimental  studies. 


I.  INTRODUCTION 


In  the  design  of  deployable  spaceborne  antennas  with  large  flexible  dish 
reflectors  and  long  flexible  feed-support  booms  (see  Fig.l),  it  is  of  importance 
to  quickly  damp  out  the  dish  vibrations  Induced  by  external  disturbances  and/or 
spacecraft  motions,  and  to  accurately  control  the  attitude  of  the  antenna-feed 
with  respect  to  a specified  reference  frame  [l]-[4].  Here,  we  propose  practical 
design  methods  which  lead  to  simple  implementable  feedback  systems  for  control- 
ling the  dish  vibrations  and  antenna-feed  attitude.  For  clarity,  we  shall  use 
simple  mathematical  models  to  illustrate  the  basic  ideas  and  the  detailed  deve- 
lopment and  justification  of  the  mathematical  results  will  be  omitted. 


II.  DISH-REFLECTOR  VIBRATION  CONTROL  BY  MEANS  OF  MOVABLE  DAMPERS 

Consider  a circular  dish  whose  vibratory  motions  about  a given  static  equi- 
librium configuration  (for  example,  a parabolic  cross  sectional  profile)  is  de- 
scrlbable  by  the  following  wave  equation  in  polar  coordinates: 

’'Consultant , Jet  Propulsion  Laboratory. 

**E.C.Hong  and  J.S. Sarina  contributed  to  Sections  III  and  II  respectively. 
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(1) 


p = (Tw^)_  + Tr  + r (r  ^TWg)^,  + f 


tt 


r r 


defined  on  the  spatial  domain  Q = {(r,6):  0 < 0 < 27T,  0<r.  <r<r}as  shown  in 
Fig. 2,  where  w is  the  displacement  about  the  static  equilibrium; °p  = p(r,6)  is 
the  mass  density;  T = T(r,0)  is  the  tension  and  f corresponds  to  the  control  or 
a damping  force.  The  lettered  subscripts  denote  partial  differentiation. 

Assuming  that  the  dish  is  clamped  at  the  inner  and  outer  rims,  the  boundary 
conditions  are; 


w(t,r  ,0)  = w(t,r.  ,0)  = 0 for  all  t and  O<0  < 2tt. 


in 


In  addition  to  (2),  we  have  the  periodicity  requirement: 


w(t,r,0)  = w(t,r,27T) 


for  r . < r < r . 

in  o 


The  total  energy  of  the  dish  at  any  time  t is  given  by 
,2vi  ^ r 


g(t)  = ^ 


ni- 

; 

in 


{p|w^p  + T(|w^ 


+ r w^ 


■)  } r dr  d0 . 


(2) 


(3) 


(4) 


Using  (l)-(3)  and  integration  by  parts,  it  can  be  readily  verified  that  the  time 
rate-of-change  of  energy  is 


g(t)  = 


- 2tt  - r 

/ /■ 

^in 


f (t,r,0)w^(t,r,0)  rdrd0. 


(5) 


A possible  approach  to  vibration  damping  is  to ^choose  the  control  or  damping 
force  f in  a given  admissible  class  such  that  S(t)  is  minimized  [5], [6].  Con- 
sequently, the  vibrational  energy  §(t)  is  reduced  as  quickly  as  possible.  A 
simple  choice  for  f is  a spatially  distributed  feedback  control  in  the  form  of 
a linear  damping  force  given  by 


f(t,r,0)  = -g(t,r,0)w^(t,r,0) : 


(6) 


where  g is  a positive  damping  coefficient.  Unfortunately,  such  a distributed 
control  or  damper  cannot  be  readily  implemented.  Therefore,  we  shall  consider 
a more  restricted  form  of  (6)  which  is  amenable  to  physical  implementation. 

Let  Q.  (t)  be  a proper  subset  of  Q,  denoting  the  effective  region  for  the  con- 
trol at  time  t.  We  assume  that 


-g w ^(t,r,0) 


f(t,r,0)  = 


on  9.  (t) , 


on  (t)  , 

c 


(7) 


144 


where  g Is  a specified  positive  real  number.  Let  li.  ^ be  the  set  of  all  admissi- 
ble SI  's.  We‘-  wish  to  find  a Si  (t)  G XL  , such  that  ^ 
c c ad 


fi(t) 


f 


f (t,r,0)w^(t,r,0)  d Si  = 


-/ 

\(t) 


g|w^(t,r,0) 


dSi 


(8) 


takes  on  its  minimum  value.  This  problem  is  analogous  to  that  of  silencing  a 
large  drum  as  quickly  as  possible  by  means  of  movable  damping  pads  which  are 
effective  only  over  certain  portions  of  the  drum  surface. 

In  what  follows,  we  shall  consider  a special  form  of  movable  damper  which 
is  suitable  for  the  dish  reflector  (see  Fig. 3).  Let  Si  = {(r,0):  0<r.  <r<r  : 
O^0-^0q<2tt}  , where  Bg  is  a given  aperture  angle.  We  define  the 

effective  region  Si^(6(t))  for  the  control  (3)  as  a rotation  of  Si^^  given  by 


Si  (6(t)) 

c 


{(r,0):  0<r,  <r<r  ; 

in  o 


6(t)  <04  0 +6(t)}. 
o 


(9) 


The  rotation  angle  6(t)  is  to  be  chosen  such  that  §(t)  given  by  (8)  is  minimized. 
Let  6*(t)  denote  an  optimum  angle,  then 


£;(t,6'^(t)) 


/ 


9^+5” (t) 


* 

6 (t) 


g]w^(t,r,6) I ^r  dr  d0. 


(10) 


Evidently,  6 (t)  corresponds  to  a rotation  angle  at  which  the  average  kinetic 
energy  over  the  effective  control  region  is  maximized.  It  is  of  interest  to 
obtain  estimates  of  the  total  energy  decay  of  the  dish  with  the  foregoing  opti- 
mized movable  damper.  Unfortunately,  useful  £i  priori  estimates  of  the  energy 
decay  are  not  readily  obtainable  directly  from  (l)-(3)  and  the  optimized  control 
(7).  Therefore,  we  shall  resort  to  computer  simulation  at  this  point. 


Consider  a dish  with  constant  p and  T.  In  this  case,  we  can  express  the 
solutions  of  (l)-(3)  with  the  optimized  movable  damper  in  the  form: 


w(t ,r ,0) 


CO 

Z 


a 

mn 


(t)  4>^^(r,0), 


(11) 


where  (j)  is  the  orthonormalized  eigenfunction  of  the  Laplacian  operator  with 
boundary  conditions  (2)  and  (3)  corresponding  to  the  eigenvalue  -A  given  by 
the  n-th  root  of  the  equation: 


J_^(X  ti^/Y)  Y^(A  r^/Y)  = r^Y)  Y^(A  r .^Y  ) , m=l,2,...,  (12) 

where  = T/p>0;  J and  Y denote  the  m-th  order  Bessel  functions  of  the  first 
and  second  kind  respectively. 
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It  can  be  verified  that  the  modal  coefficients  a (t)  satisfy  the  following 
countably  infinite  dimensional  system  of  ordinary  differential  equations: 


-1 


a (t)  + gp  A (t)a  (t)  + A a (t) 
mn  m mn  mn  mn 


= -gp 


00  ^ 

‘ II 


k,£=l 


i 


(t))0j^(0+6  (t))d6 


in 


m,n=l,2, , . . , 


(13) 


where 


(p  (r,0)  = R (r)0  (0) , 
mn  mn  m 


R (r)  = A {J  (A  r/Y)-[J  (A  r.  /y) /Y  (A  r.  /y)]Y  (A  r/y)}, 
mn  mn  m mn  m mn  in  ' m mn  in  ^ m^  mn 


0 (0)  = (it)  2^03  (m0  + ip  ) , 
m m 


A 

mn 


^/[r  p(r  ) 
o o 


r . 

in 


p(ri„)l. 


(14) 


p(r) 

A (t) 
in 


= J ^ (A  r/y)-[J  (A  r/y)/Y  (A  r/y)]Y  ^ (A  r/y), 
m-1  mn  ' m'  mn  ' ' m mn  ' m-1  mn  ^ 


i 


+6*(t) 


3*C 

6 (t) 


|0^(0)Pd6 


= { 0^  - m sin^mO^  sin[2(m6  (t)  + f^^j)  ] 


-1 


+ (2m)  sin(2m0  ) [1  - 2 sin^(m6*(t)  + ijj  )]}. 


' m 


It  is  evident  that  for  large  m,  the  damping  coefficient  gp  ^A^(t)  tends  to 

g0  / (2ttp)  as  m 
o 

To  gain  some  idea  on  the  performance  of  the  optimized  movable  damper,  com- 
puter simulations  are  made  yslng  truncated  versions  of  (13)  with  additional  small 
residual  damping  terms  g p”  a (t) , where  g = 0.001.  The  numerical  values  for 
the  dish  parameters  are: 

p = 0.05  kg./m^,  T = 8.9  kg./sec^,  r^^  = 1 m. , t^=51  m. , and  g = 0.1. 

Fig. 4 shows  the  energy  decay  for  various  Initial  kinetic  energy  distributions 
and  different  forms  of  movable  dampers  including  a damper  sweeping  at  a constant 
rate  of  20  /sec. , a randomly  positioned  damper  and  the  optimally  positioned 
damper  as  discussed  earlier.  In  the  computer  simulations,  the  dish  is  parti- 
tioned into  36  10°-sectors.  The  optimal  damper  position  is  determined  by  locat- 
ing a sector  with  the  highest  kinetic  energy.  In  the  case  of  the  randomly 
positioned  damper,  the  damper  position  at  any  time  is  determined  by  a random 
number  generator.  From  the  numerical  results,  it  is  apparent  that  in  all  cases. 
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the  total  modal  energy  (first  25  modes)  decays  monotonically  with  time.  As  ex- 
pected, the.  fastest  energy  decay  is  achieved  by  the  optimally  positioned  damper. 
However,  it  can  be  seen  from  Fig. 4c  that  the  effectiveness  of  the  optimally  posi- 
tioned damper  decreases  when  all  the  vibrational  energy  is  concentrated  in  the 
high-frequency  modes.  This  is  consistent  with  the  fact  that  the  damping  coef- 
ficient gp“^A^(t)  g6^/(27rp)  as  m -S'  “ as  mentioned  earlier. 

Although  the  foregoing  results  are  based  on  a highly  simplified  model  for 
the  dish,  the  proposed  approach  may  also  be  applied  to  more  complex  realistic 
situations  where  the  dish  is  constructed  from  elastic  ribs  covered  with  thin 
flexible  material.  In  this  case,  optimally  positioned  patch  dampers  may  be 
more  suitable.  Finally,  the  movable  dampers  proposed  here  can  be  implemented 
either  by  passive  dampers  in  contact  with  the  dish  surface  or  by  active  velocity 
feedback  controls.  However,  passive  dampers  are  preferred  from  the  reliability 
standpoint,  since  there  is  no  danger  of  pumping  energy  into  the  dish  in  case  of 
controller  failure. 


III.  ANTENNA-FEED  ATTITUDE  CONTROL  BY  MECHANICAL  DECOUPLING 


For  a typical  spaceborne  antenna  shown  in  Flg.l,  the  antenna-feed  is  rigidly 
attached  to  the  tip  of  a flexible  boom  which  could  undergo  both  torsional  and 
bending  vibrations.  It  is  required  to  control  the  antenna-feed  attitude  with 
respect  to  a given  reference  frame  on  the  spacecraft.  In  the  usual  situation, 
a control  torque  and/or  force  are  introduced  at  the  boom-root.  Consequently, 
any  control  action  on  the  antenna-feed  must  be  transmitted  through  the  flexible 
boom  which  is  an  infinite  dimensional  system.  This  greatly  complicates  the  de- 
sign of  the  feed-attitude  control  system.  Here,  we  propose  to  mechanically  de- 
couple the  antenna-feed  from  the  boom  so  that  the  feed-attitude  control  system 
can  be  designed  without  taking  the  boom  dynamics  into  consideration. 

To  illustrate  the  basic  idea,  we  consider  the  special  case  where  the  boom 
vibrations  are  strictly  torsional  in  nature.  The  antenna-feed  is  represented 
by  a rigid  disk  (with  mass  polar  moment  of  inertia  J ) attached  to  the  boom-tip 
as  shown  in  Fig. 5.  Let  P be  a point  on  the  disk  whose  angular  position  with 
respect  to  the  fixed  y-axis  is  denoted  by  0 . First,  let  the  control  torque  T 
be  applied  at  x=0,  and  it  is  required  to  choose  T such  that  0 (t)  is  as  close 
to  a specified  reference  angle  0^  (say  6j^-0)  possible  at  al?  times.  Here, we 
observe  that  any  control  action  T =T  (t)  will  excite  boom  vibrations.  Hence, 

T must  be  manipulated  in  such  a way  tfiat  the  desired  effective  control  torque 
appears  at  the  boom-tip  for  controlling  the  feed  motion.  Moreover,  any  tor- 
sional motion  of  the  boom  due  to  external  disturbances  will  affect  9 . These 
undesirable  features  greatly  complicate  the  design  of  the  feed-attitude  control 
system.  Here,  we  note  that  since  the  relative  angle  9j^-9  is  to  be  controlled, 
the  foregoing  undesirable  features  can  be  bypassed  by  introducing  a rotary  actu- 
ator (for  example,  an  electric  servomotor  with  stator  moment  of  inertia  J ) at 
the  boom-tip  to  decouple  J from  the  boom  (see  Fig. 6).  In  absence  of  actuator 
friction  and  an  actuating  signal  to  the  actuator,  J rotates  freely.  Thus,  the 
angular-position  control  for  J can  be  achieved  by  applying  an  appropriate  actu- 
ating signal  depending  only  on  (0„-0  ),  Independent  of  the  boom  motion.  Note 

R o 
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that  a control  torque  applied  to  J generated  by  the  actuator  is  always  accompa- 
nied by  a reaction  torque  on  the  boom.  However,  in  absence  of  actuator  fric- 
tion, there  is  no  feedback  from  the  boom  motion  to  J . Consequently,  the  boom 
motion  has  no  effect  on  6 . In  real  physical  situations,  friction  coupling 
between  and  always  exists.  Its  effect  on  the  overall  system  is  studied 
using  the  following  simplified  mathematical  model; 

(a)  Torsional  vibrations  of  boom: 


I,  = (GJ9-  ) - 6 6. 

b Itt  lx  X It 


(15) 


with  boundary  conditions: 


3^(t,0)  = 0,  (GJ0^^) 


= -J 


1 dt^®lt'L 


) + c(0 


It'L 


) - T , 


(16) 


where  0^  is  the  torsional  angle  of  the  boom  with  respect  to  the  fixed  y-axls;I 
and  GJ  denote  the  mass  moment  of  inertia  and  torsional  rigidity  per  unit  length 
of  the  boom  respectively;  6 is  a positive  damping  coefficient;  c is  the  actuator 
stator-rotor  friction  coupling  coefficient,  and  T is  the  control  torque.  As  in 
Section  II,  the  lettered  subscripts  denote  partlaS  differentiation  and  0 =d0  /dt. 


(b)  Antenna-feed  (J  ) motion: 
— — — — o — ■ — — 


J 0 = -c(9  - 


o o 


lt‘L 


) + T 


(17) 


For  simplicity,  we  assume  that  the  reference  angle  0j^=O  and  the  actuator 
action  is  Instantaneous.  Thus,  we  can  introduce  the  usual  proportional-plus- 
rate  feedback  control  for  positioning  J : 


T = - (a0  + 30  ) , 

C 0 0 


(18) 


where  a and  3 are  given  real  positive  constants.  Thus,  the  complete  mathe- 

matical model  is  given  by  (15)- (18)  which  can  be  reformulated  as  a linear  evolu- 
tion equation  defined  on  a suitable  infinite  dimensional  state  space.  It  is  of 
interest  to  establish  nontrivial  sufficient  conditions  for  asymptotic  stability 
of  equilibrium  in  the  sense  of  Lyapunov  with  respect  to  the  "energy  norm"  ||w|| 
defined  by 


w 


€ 


(r  0 


b'  It' 


+ GJ  0^^  ^)  dx  + J,  0 


I'^lt'L' 


J 0^ 
o o 


+ a0' 


(19) 


Unfortunately,  this  task  is  not  straightforward.  However,  it  can  be  verified 
[7]  that  the  determination  of  the  eigenvalues  or  poles  of  the  complete  system  is 
reducible  to  solving  the  following  nonlinear  eigenvalue  problem:  Find  all  non- 
trivial pairs  (0,X)GH^(J^)  ©C  satisfying: 
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(20) 


(GJ0  ) - \(6  + Xlje  = 0, 

XX  b 

0(0)  = 0,  (GJ0  )|  = -A^q(A)0(L),  (21) 

X JL 

o 

where  H (fi)  denotes  the  complex  Sobolev  space  of  order  2 on  = ]0,L[;  C denotes 
the  field  of  complex  numbers  and 

J + [J, (3+c)  + cJ  ]A  + J,a 

q(X)  . i 5 . (22) 

J A^  + (3+c)A  + ot 
o 

For  this  problem,  we  can  deduce  the  following  result  [7], [8]:  All  unstable  elgen- 

values  A = A„  + jA^  are  complex  and  they  lie  inside  the  half-disk: 

R i ■ ' 

{A  GC:  A^  + A^  < ct/J  , A > 0}. 

K i o K 

This  result  gives  an  upper  bound  for  the  growth  rates  and  natural  frequencies  of 
the  unstable  modes.  Note  that  this  bound  is  independent  of  the  boom  parameters. 

Another  aspect  having  practical  importance  is  the  determination  of  sensiti- 
vity of  the  eigenvalues  with  respect  to  the  friction  coefficients  c and  6.  This 
can  be  studied  by  a perturbation  analysis.  Fig. 7 shows  the  loci  of  eigenvalues 
with  variable  parameters  6 and  c for  a 10-meter  uniform  boom  with  = 0.64x10 
kg.m. , GJ^=  25  kg.m^/sec?  , and  angular  position-control  system  parameters:  J = 
6.4  X 10  kg.m?  , J = 1.2x10  kg.m?  ,a  = 0.474  kg.m?  /sec?,  and  3 = 0.0754 
kg.m?  /sec.  It  can  be  seen  from  Fig. 7 that  the  control  system  poles  for  the 
indicated  variations  of  parameter  values  for  6 and  c are  clustered  around  the 
poles  A = -3.14  + 5.43J  corresponding  to  6 = 0 and  c = 0.  This  shows  that  the 
control  system  poles  are  insensitive  to  variations  in  the  friction  coefficients. 

The  effectiveness  of  the  decoupling  is  revealed  in  the  computer  simulation 
results  given  in  Figs. 8 and  9.  Fig- 8 shows  that  for  the  indicated  initial  con- 
ditions, the  ratio  of  peak  magnitudes  of  the  control  system  positional  angle  0 
and  the  boom-tip  deflection  is  approximately  0.035,  which  corresponds  to  a 
significant  reduction  in  vibration  amplitude. 

To  verify  the  validity  of  the  proposed  decoupling  approach,  a small  scale 
model  of  the  foregoing  system  is  constructed  (see  Fig. 10).  The  boom  is  simu- 
lated by  a thin  steel  wire.  The  effectiveness  of  the  decoupling  is  clearly 
shown  in  Fig. 11.  Here,  the  ratio  of  the  peak  magnitudes  of  0 and  the  boom-tip 
deflection  is  approximately  0.02.  Fig. 12  shows  the  boom  vibrations  induced  by 
the  positional  control  system  motion  for  different  boom  stiffnesses.  From  the 
experiments,  it  is  found  that  the  friction  torques  in  the  servomotor  and  the 
sensing  potentiometers  are  predominately  static  in  nature.  However,  their  pre- 
sence does  not  alter  the  effectiveness  of  decoupling.  A more  detailed  descrip- 
tion of  the  sensitivity  analysis  and  experimental  results  are  given  in  [9]. 

The  foregoing  discussions  have  been  limited  to  the  simplest  case  with  tor- 
sional motion  only.  In  an  actual  antenna  system,  the  flexible  boom  could  under- 
go both  torsional  and  bending  vibrations.  It  is  required  to  keep  a given  point 
on  the  antenna-feed  aligned  with  a specified  point  on  a reference  plane  at  the 
boom-root.  Here,  we  may  mechanically  decouple  the  boom  motion  from  the  feed 
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positioning  control  system  by  introducing  a composite  rotary  and  linear  actuator 
as  shown  in  Fig. 13.  Thus,  the  antenna-feed  may  be  aligned  by  a combination  of 
linear  and  rotary  motions  of  the  actuators.  Analytical  studies  and  computer 
simulation  for  this  system  are  described  in  reference  [8]. 

From  the  analytical  studies,  computer  simulation,  and  experimental  studies, 
the  introduction  of  the  proposed  mechanical  decoupling  appears  to  be  a simple 
and  effective  approach  to  the  design  of  implementable  antenna-feed  attitude  con- 
trol systems  for  large  spaceborne  antennas.  This  approach  avoids  the  consider- 
ation of  an  infinite  dimensional  model  in  control  system  design.  To  maximize 
the  effectiveness  of  decoupling,  the  actuator  friction  should  be  as  low  as  pos- 
sible, and  to  avoid  instability  problems,  the  boom  must  have  sufficient  stiff- 
ness and  structural  damping. 
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ANTENNA-FEED 

PLATFORM 


Figure  1.  Typical  configuration  of  a spaceborne  antenna 
system  with  antenna-feed  rigidly  attached  to 
the  tip  of  a flexible  boom. 
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Figure  2.  Spatial  domain  of  the  dish  reflector. 


Figure  3.  Sketch  of  dish  reflector  with  a movable 
damper  (The  damper  may  also  be  placed  on 
the  backside  of  the  reflector  for  reducing 
its  effect  on  the  antenna  characteristics) 


ENERGY  (MODfiL) 


Figure  4a 

Figure  4.  Energy  decay  for  various  initial  kinetic 
energy  distributions  and  different  forms 
of  movable  dampers. 
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ENERGY  (MODAL) 


Figure  4b 
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ENERGY 


Figure  5.  Attitude  control  of  antenna-feed  which  is 
rigidly  attached  to  the  tip  of  a flexible 
boom. 


Figure  6.  Proposed  mechanical  decoupling  of  the  antenna- 
feed  from  the  flexible  boom  with  torsional 
motion  only. 
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Figure  7.  Loci  of  eigenvalues  with  variable  parameters  6 
and  c for  a 10-meter  uniform  boom. 

X—  6=0.  ,c=0.; 

A — 6 = 0.  , c = 0.005; 

• — 6 = 0.00625,  c = 0.005; 

— 6 = 0.0125,  c = 0.005; 

■ — 6 = 0.025,  c = 0.005. 

Other  parameter  values  are  given  in  text. 
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Figure  8.  Angular  displacements  of  boom-tip  [9  (t,L)]  and  J 
[0  (t) ] due  to  Initial  torsional  deflection  of  the 
bo8m  [0.(0, x)  = O.OOlx^  rad.,  0^  (0,x)  SO  rad. /sec 
0^x<10m.]  with  J Initially  at  rest.  I,  = 0.64x 
10"^  kg.m.  J J = 6?4xl0-‘*  kg.m.^  ; J = 1.2  x lo"^ 
kg.m?  ; GJ  = 25  kg.m.^  /sec?  ; 6 = 0.025  kg. m. /sec. 
01  = 0.474,  3=  0.0754  and  c = 0.005  kg. m.^ /sec. 


Figure  9.  Angular  displacements  of  the  boom-tip  [0^(t,L)] 
and  J [6  (t)  ] due  to  initial  angular  displace- 
ment of  [6  (0)  = 0.05  rad.,  0 (0)  = 0.  rad./ 
sec.]  witH  bo8m  initially  at  res?.  The  system 
parameter  values  are  given  in  Figure  8. 


THIN  STEEL  WIRE 


ANGULAR  POSITION 
SENSOR 


ROTARY  ACTUATOR 


DISK 
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Figure  10.  Experimental  setup  for  a small  scale  model  of 
the  proposed  antenna-feed  angular-position 
control  system  with  decoupler. 
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Figure  11.  Angular  displacements  of  J (upper  trace;  scale: 

2°/div.)  and  boom-tip  (lower  trace;  scale:  5°/div.) 
due  to  initial  boom  displacement ,i  with  angular  posi- 
tion-control system  initially  at  rest.  Time  scale: 
0.1  sec./div.  Equivalent  spring  constant  for  boom: 
0.1362  kg.m?/sec.,  = 2.41  xl0~  kg.m?  and  J = 
1.32X10"  kg.m?  ° 


(a)  (b) 

Figure  12.  Angular  displacements  of  J (upper  traces)  and  boom- 

tip  (lower  traces)  due  to  initial  displacement  of  J , 

with  boom  initially  at  rest.  (a)  Equivalent  spring 

constant  for  boom:  0.1362  kg.m?/sec^.;  scale:  upper 

trace  l°/dlv. , lower  trace  5°/div. ; (b)  Equivalent 

spring  constant  for  boom:  4.52  x 10“  kg.m?  /sec?  ; 

scale:  upper  trace  5°/dlv. , lower  trace  50°/div. 

The  values  for  and  J are  identical  to  those  in 
1 o 
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ANTENNA-FEED 
PLATFORM 


Figure  13.  Proposed  mechanical  decoupling  of  the  antenna-feed 
from  the  flexible  boom  with  torsional  and  bending 
motions . 
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A CLOSED-LOOP  PRINCIPAL  COMPONENT 
ANALYSIS  OF  A TETRAHEDRAL  TRUSS* 

Edmond  A.  Jonckheere 

Department  of  Electrical  Engineering  — Systems 
University  of  Southern  California 
Los  Angeles,  CA  90007 


ABSTRACT 

In  this  paper,  a tetrahedral  truss,  representative  of  a fairly  broad  class 
of  large  space  structures,  is  considered.  A new  method,  measuring  the 
dynjmical  importance  of  each  elastic  mode  when  the  structure  is  under  feedback 
control,  is  presented. 


I.  INTRODUCTION 

The  Tetrahedral  Truss  devised  by  Charles  Stark  Draper  Labs  (Ref.  21 ) has 
the  simple  geometric  structure  of  a tetrahedron,  and,  despite  its  simplicity, 
it  models  the  feeding  tower  of  a large  cassegrain  antenna  (Eig.  1.1 ).  It  is 
considered  as  the  simplest  nonplanar  structure  capable  of  representing  a Large 
Space  Structure . The  nodes  of  the  tetrahedron  are  assumed  to  be  torque  free, 
so  that  the  connecting  rods  are  only  undergoing  tract  ion/ ccmpr  ess  ion  efforts. 
The  three  nodes  of  the  tetrahedral  basis  are  connected  to  the  (inertially 
stabilized)  antenna  dish  via  three  bipeds,  containing  a total  of  six 
CO- located  sensors/ actuators.  The  tetrahedral  apex  represents  the  position  of 
the  seconiary  reflector  whose  motion,  because  of  the  nature  of  the 
electromagnetic  problem,  must  be  controlled  extremely  accurately.  Hence  the 
problem  is  — control  the  motion  of  the  tetrahedral  apex,  by  means  of 
actuators  located~in  the  bipeds,  using  sensor  information  from  the  bipeds. 

The  design  of  the  control  system  for  such  a,  distributed  parameter, 
vibrating  mechanical  structure  faces  an  old,  yet  not  conpletely  resolved, 
problem  — how  many  vibration  modes  shoxiid  the  control  system  take  into 
account  to  guarantee  (robust)  stability? 


Using  Moore's  "open-loop  principal  component  analysis"  (Ref.  7),  the  author 
and  Silverman  (Refs.  15,16)  have  shown  that  the  open- loop  importance  (more 
precisely,  the  singular  value  ®’2i-1*'^2i^  vibrating  mode  decreases 

with  increasing  eigen  frequency,  am  that  the  spectrum  of  the  singular  values 
becomes  infinitely  spread  as  the  relative  damping  goes  to  zero. 


A delicate  issue  is  whether  or  not  the  above  conclusion  can  be  extemed  to 


*This  research  was  supported  by  the  USC/TRW  Ikiiversity/lndustry  NSP  Grant 
BC!S-81 1 2327. 
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the  closed- loop  case.  Practical  experience  ("spillover"  problem.  Ref.  17)  has 
shown  that  this  is  not  always  the  case. 

To  tackle  the  closed-loop  problem,  Skelton  (Ref.  9)  developed  the  "cost 
analysis".  More  recently,  the  author  and  Silverman  (Refs.  11,12)  have 
developed  a "closed-loop  principal  component  analjrsis"  which  is  aimed  at 
measuring  how  much  each  mode  is  important  vriien  the  system  is  closed-up  by  a 
LQl  feedback  loop.  This  technique  assigns  a measure,  or  "closed-loop  singular 
value",  to  each  mode.  The  main  purpose  of  this  paper  is  to  look  at  the 
tetrahedral  truss  control  problem  in  the  light  of  this  "closed-loop  principal 
component  analysis."  Surprisingly  enough,  the  conclusions  of  the  closed-loop 
principal  component  analysis  do  not  quite  well  corroborate  the  results  of  the 
open- loop  analysis.  While  the  spectrum  of  the  open- loop  singular  values  is 
infinitely  spread,  the  spectrum  of  the  closed-loop  singular  values  is 
surprisingly  concentrated.  In  more  intuitive  but  less  precise  terms,  this 
means  that,  while  the  modes  appear  disparately  important  open-loopwise,  they 
are  about  equally  important  closed-loo pwise . This  discrepancy  between  the 

open- loop  and  the  closed- loop  analyses  can  be  interpreted  as  a tentative 
theoretical  explanation  of  the  spillover  problem.  Also,  it  turns  out  that  the 
closed- loop  importance  of  each  mode  depends  on  the  robustness  requiranents  on 
the  LQGr  feedback  loop.  For  example,  some  seemingly  unimportant  modes  can 
become  very  important  under  stringent  feedback  perfomance  requiranents  (Refs. 
13,14)-  This  paper  basically  addresses  such  issues. 


II.  THE  TETRAHEDRAL  TRUSS 

The  motion  of  the  tetrahedral  truss  is  most  conveniently  represented  by  its 
modal  state  equations: 

dx(t)/dt  - Ax(t)  + Bu(t)  (ll-l) 

yj,(t)  = Cjjc(t) 

In  the  above,  x is  the  modal  state  vector;  position  of  the  ith 

mode,  while  is  the  rate  of  the  same  mode.  A is  a block-diagonal  matrix 
with  blocks  of  the  form 


A s. 
, i 


is  the  ith  modal  eigenfrequency;  the  eigenfrequencies  are  classified  in 
increasing  order.  $ is  the  relative  damping,  u is  the  vector  of  the  six 
axial  forces  acting  along  the  legs  of  the  bipeds;  the  rows  of  B with  odd 
indices  are  vanishing,  while  the  rows  with  even  indices  contain  the 
coefficients  of  influence  of  the  actuators  on  the  corresponding  mode.  y^.  is 
the  six-dimensional  output  vector  of  the  rate  sensors  located  in  the  bipeds. 
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and  Cj.=B ' . 


Some  more  outputs  can  be  defined:  The  six-dimensional  output  y„  of  the 
position  sensors  located  in  the  bipeds,  and  the  two-dimensional  output 
giving  displacement  of  the  tetrahedral  apex: 

yp(t)  = Cpx(t) 

Define  tiie  transfer  functions: 

Gj.(s) 

Gp(s)  Cp(sI-A)-^B  ^ (l/s)Gj,(s) 

cys)  :=,=  Cjsl^)-1B 

Regardless  of  mode  frequencies  and  shapes,  IBS  transfer  functions  enjoy 
sane  ranarkable  properties  (Refs.  18,19)-  ^ a fairly  general  result,  the 

(square)  tr;ansfer  matrix  Gj,(s)  ftom  the  (co- located)  force  actuators  to  the 
rate  sensors  is  positive  real  (or  dissipative).  Further,  the  same  transfer 
function  is  lossless  if  the  relative  damping  5 vanishes.  Finally,  observe 
that  Gj,(s)  is  reciprocal,  i.e.,  Gj,(s)=G^(s) . 

Tl:ie  finite  element  method  (BASTRAN)  was  used  by  Draper  labs  to  derive  the 
second  order,  modal  equations  of  the  truss  (Refs.  19-21).  These  equations 
were  further  put  into  the  first  order,  state  equation  form  (II. 1)  by  TRW  (Ref. 
20).  The  modal  state  equations  (II. 1)  contain  12  eigenmodes.  Both  "ncminal" 
and  "perturbed"  modal  data  are  available  to  try  out  and  evaluate  control 
desi^q)  techniques  on  the  truss.  The  nominal  data  is  available  to  the  design 
engineer,  while  the  perturbed  data  represents  a realistic  departure  from  the 
nominal  data.  In  the  nominal  and  perturbed  cases,  the  eigenfrequencies  are, 
respe;ctively. 


W = 1.3420108, 

W 2 1.6647234, 

W ^ 2.8907117, 
w 4 - 2.9574139, 
Wr  - 3-3981995, 
w g 4.2044821, 

w r,  - 4-6620682, 

w g = 4.7552602, 
w Q 8.5394174, 
w.X  - 9-2505638, 
w..  - 10.284775  , 

= 12.905111  , 


1.1706554  rad/sec 
1 .4667812 
2.96461 45 

3.5577479 

3.8483893 

5.1494174 
5.6759052 
5. 71 08055 
8.9396286 
10.303582 

10.923379 
13- 966664 
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III.  CIOSED-LOOP  PRINCIPAL  COMPONENT  ANALYSIS 


Consider  a minimal  state  space  representation 


dx/dt  = Ax  + Bu  (III.1) 

y - Cx 

of  the  plant  transfer  matrix 
G(s)  C(sI-A0~''b 

y is  an  output  available  for  feedback  purposes,  while  u is  the  available 
control  action.  . 

The  basic  philosophy  of  the  "closed-loop  principal  component  analysis"  is 
to  measure  how  much  each  "mode"  (more  generally,  each  state  component) 
participates  in  the  inherent  closed- loop  behavior  of  the  system.  Model 
reduction  and  reduced  coapensation  then  follow  by  deleting  the  unessential- 
modes.  There  are  many  ways,  depending  on  the  specifications,  of  closing  up  a 
system  by  a feedback  loop,  but  it  is  attractive  to  do  this  in  an  optimal  vra.y 
using  the  Linear-Quadratic-Gaussian  approach.  This  approach  is  indeed 
systematic  and  fairly  well  understood.  It  also  has  the  advantage  of  endowing 
the  design  with  fairly  general  robustness  properties.  More  interestingj.y,  the 
LQG  design  can  be  tuned  to  optimal,  sensitivity  properties  by  correct 
adjustment  of  the  the  quadratic  criterion  and  the  noise  covariances.  To 
formulate  the  problem  within  the  LQG  setup,  a disturbance  Gaussian  white  noise 
d(t)  is  added  at  the  inp.it  and  a measurement  Gaussian  white  noise  n(t)  is 
added  at  the  output,  as  depicted  in  Pig.  III.1.  To  tune  the  LQG  design  to 
optimal  robustness  properties,  it  is  convenient  to  introduce  the  quadratic 
criterion 


rQu'u+qQ.y'y  (III.  2) 

together  with  the  noise  statistics 

E(d(t)d'(T))  = qjj^l8(t-T)  (III. 2) 

E(d(t)n'  (t)  = 0 
E(n(t)n'(-r))  = rjjjl8(t>-'r) 

The  q's  and  the  r's  are  (positive)  parameters  which  must  be  tuned  so  as  to 
reach  satisfactory  robustness  properties.  To  understand  how  this  can  be  done, 
let  K(s)  be  a causal,  stabilizing,  strictly  proper  compensator.  A crucial 
resul.t,  apparently  due  to  Youla  (Refs.  1,2),  connects  up  the  performance 
E(Go.y'y  + TqU'u)  of  the  compensator  K(s)  with  robustness  matrices  as  follows: 
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E(<3oy'y  = 

Trace  { 

(lorjn(I-(X)-l  GKK*G*(I-K*G*)-1 
+ qQqjj^(l-GK)-''GIG*(I-K*G*)-'' 

+ roqjj^(l-KG)-''KGG^*(l-G*K*)-'' 

+ rorjj^(l-KG)-%*(l-G*K^)-''  | ds  (III. 3) 

(K*(s)  :=  K'(-s),  that  is,  the  paraconjugate  transpose  of  K(s).)  The  matrices 

(i-gkHgk,  (i-gk)“'’g,  (i-kg)-''kg,  (i-kg)“''k 

are  well  known  to  be  related  to  the  robustness  properties  of  the  loop;  the 
anaZLer  they  are,,  the  better  the  robustness  properties.  Intuitively  speaking, 
BquaiLity  (III. 3)  says  that  minimizing  the  expected  value  E(q^y'y  + r^u'u)  is 
equivalent  to  minimizing  the  integral  over  all  frequencies  of  "sizes"  of 
"robustness"  matrices,  Dtore  intuitively,  this  says  that  minimizing  a quadratic 
criterion  boosts  robustness.  In  the  weighted  average  of  the  right  side  of 
Eqmtion  (III. 3),  the  matrices  (I-GK)“'GK  and  (l-KG)"^KG  are  of  particular 
importance;  they  are  indeed  related  to  the  stability  margin  of  the  feedback 
system  (Ref  . 3);  the  smaller  these  matrices,  the  more  the  feedback  syston  can 
accoranodate  variations  of  GX  and  KG,  respectively,  before  the  loop  becomes 
unstable.  In  closed-loop  model  reduction,  the  reduction  error  is  considered  as 
plant  variation  and  is  expected  to  be  taken  care  of  by  the  stability  margin 
property  of  the  loop.  Hence,  for  this  to  be  the  case,  it  is  imperative  that 
the  I/3G  problem  underlying  model  reduction  boosts  the  stability  margin.  This, 
can  be  accomplished  by  assigning  the  following  values  to  the  parameters: 

and  by  taking  € arbitrarily  small.  Yet  this  technique  can  be  further  refined 
by  considering  colored  noises  d and  n and  also  by  injecting  them  at  other 
nodes  along  the  loop  (Refs.  1,2,3);  we  postpone  these  topics  to  a further 
paper . 


Invoking  the  Separation  Principle,  one  can  split  the  design  into  an  optimal 
filtering  and  an  optimal  control  problan.  It  is  well  known  (Refs.  4,5)  that 
these  optimization  problans  involve  in  a crucial  way  the  filtering  algebraic 
Riccati  equ^ition 

APm  + Pj[/^'  + qjnffl'  - Ejjp'CPjj/rj^  0 

and  the  control  algebraic  Riccati  equation 
A'Pq  + PqA  + q^C’C  - PoBB'Po/ro  - 0 
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The  optimal  LQG-  controller  consists  in  the  cascade  of  the  optimal  filter 
dw/dt  = Aw  + Bu  + r””*  PjjP' (z-Cw) 
and  the  optimal  control  gain 
u»-r"% ' PqW 

This  leads  to  the  following  state  space  representation  of  the  compensator 
K(s): 

dw/dt  ==  IW  + Gz 
u=Hw 


where 

F A - - P„0'r-'C 

0 V/% 

The  following  is  a fundamental  result  (Refs.  8,10-12): 

Theorem . Let  (A,B,C)  he  a minimal  realization  of  G(s).  Then  the  eigenvalues 
of  PqJ^  are  similarity  invariants.  Further,  these  eigenvalues  are  real  and 
strictfy  positive.  If  . . . denote  the  eigenvalues  of  PoPm 

decreasing  order,  then  there  exists  a "halancing"  transformation  ^ and  a 
"balanced"  state  space  realization  (A,B,^):=(^AT“  ,^B,CT~' ) in  which 


where 

M diag  | *"  ! 

The  physical  interpretation  of  the  above- defined  quantities  (the 
"closed- loop  singular  values")  should  be  clear.  Assume,  for  example,  that 
is  "anall";  then  the  balanced  state  component  Xfe  "easy"  to  filter  (^ 

"anall"  along  the  kth  direction)  and  "easy"  to  control  (^  "snail"  along  the 
kth  direction);  hence,  intuitively,  3^  is  an  unessential  state  component;  it 
is  not  significantly  involved  in  the  closed-loop  LQG  behavior  of  the  system, 
and  it  can  be  discarded.  Technically,  this  model  reduction  is  done  as  follows: 
Assume 

M ==  block  diag  { , M22  1 
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with  "much  larger  than"  M22-  Ifertition  both  the  plant  state  equations 


dx^/dt  = ^ Xl 

d^/dt  = 


+ A12S 

+ AoqXo 


+ B-jU 

+ ^u 


y = C^l  x^  + 022^ 


and  the  controller  state  equations 


dw-j/dt  s:  p ^ ^ w-j  + F .j  pw^  + &-j  z 

dv^/dt  F2I— 1 

u = w-|  + H2W2 


conformably  with  M.  The  obvious  reduced-order  model  is  (A-]-] , ,C^ ),  and  it  is 

easily  proved  that  its  optimal  LQG  controller  is  the  obvious  reduced-order 
compensator  (F-ii>  The  problem  of  the  stability  of  the  full  plant  G(s) 

closed  up  by  the  reduced  compensator  is  tackled  in  the  paper  of  the  author  and 
Silverman  (Ref.  12).  It  is  there  proved  that  stability  is  guaranteed  provided 
M22  is  "sufficiently  small". 


Besides  this  stability  result,  the  closed- loop  principal  conponent  analysis 
yields  other  ranarkable  results  (Ref.  12),  relevant  to  LSS: 

Theoren.  Let  (A,B,G)  be  a minimal  realization  of  G(s).  Take  q^fr"^  and 
qQ=rjj|Tl  ^en  G(s)is  a reciprocal,  lossless  transfer  matrix  if  and  only  if  mj^ 
= 1,  ks=1,2, ...  Further,  in  that  case,  the  balanced  state  space  realization  is 
unique  within  orthogonal  similarities  and  in  any  such  rea].ization  As=-A'  and 
C=B ' . 

A few  words  about  computations.  The  main  computational  burden  is  the 
solution  to  both  algebraic  Riccati  equations.  However,  there  is  a fast  and 
reliable  procedure  for  solving  the  algebraic  Riccati  equation:  the  technique 
of  leiub  (Ref.  6),  which  proceeds  via  the  quasi-upper  triangular  form  of  the 
Hamiltonian  matrix.  Once  P_  and  are  computed,  their  simultaneous 

diagonalization  proceeds  via  the  factorization  of  one  of  them,  say,  Pjji=LR'  > 
followed  by  the  eigenanalysis  of  L’P^L;  see  Ref.  8. 


IV.  RESULTS 

In  this  section,  the  tetrahedral  truss  is  analysed  in  the  light  of  the 
closed- loop  principal  component  analysis.  Several  plant  transfer  matrices, 
corresponding  to  different  conbinations  of  sensors  and  actuators,  are 
considered.  In  each  case,  the  modes  are  classified  by  order  of  closed- loop 
importance.  The  parameters  q and  r of  the  quadratic  criterion  and  the  noise 
statistics  are  varied  so  as  to  determine  how  the  classification  is  affected  by 
the  stability  margin  requironents . Tentative  conclusions  for  model  reduction 
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are  derived. 


IV. 1 . force  actuator/ rate  sensor  transfer  matrix;  nominal  data;  no 


The  6x6  transfer  matrix  from  the  (co- located)  force  actuators  to  the  rate 
sensors  is  lossless  and  reciprocal;  hence  all  of  the  |#'s  must  be  equal  to  1. 
Since  we  know  in  advance  the  result,  this  case  is  rather  a good  test  of  how 
well  the  numerical  software  performs.  The  software  has  given  the  right  values, 
up  to  a relative  error  of  10“'“^;  see  Table  IV.  1.  This  is  quite  satisfactory. 
Regarding  the  balancing  matrix  T,  which  transforms  the  modal  state  space 
representation  (A,B,C„)  into  the  balanced  state  space  representation  (A,B,^), 
it  is  easily  found  that  this  transformation  can  be  taken  as  block  diagonal 
with  blocks  of  the  form 


However,  the  balancing  transformation  is  uniquely  defined  up  to  orthogonal 
transformations,  and  the  software  does  not  pick  up  precisely  the  above 
transformation  (there  are  no  reasons  why  it  should). 

Thus,  the  closed-loop  principal  canponent  analysis  does  not  manage  to  make 
any  discrimination  between  essential  and  nonessential  modes,  when  the  undamped 
truss  is  closed  up  by  a feedback  loop  frcm  the  rate  sensors  to  the  force 
actuators.  Intuitively,  the  reason  is  that  the  rate  sensors  differentiate  the 
modal  displacanents;  this  is  roughly  equivalent  to  multiplying  the  modal 
displacements  by  their  respective  eigenfrequencies,  thereby  boosting  the 
contribution  of  the  hi^  frequency  modes  and  leveling  up  everything. 

IV.  2.  force  actuator/ rate  sensor  transfer  matrix;  nominal  data;  nonzero 


Since  a ISS  exhibits  a structural  damping,  which  is  poorly  known,  it  is 
important  to  determine  how  the  above  analysis  is  affected  by  the  damping  and 
to  make  sure  that  the  results  are  consistent  over  a realistic  range  of 
relative  damping  values. 

The  results  of  the  closed- loop  principal  component  anaI,yBis  for  Js=0.01  and 
0.001  are  given  in  Table  IV.  2.  The  introduction  of  this  damping  clearly 
spreads  the  spectrxmi  of  the  j#'s.  A distinction  between  essential  and 
nonessential  state  canponents  is  now  possible.  Observe  that  the  i#'s  appear  by 
pairs  of  roughly  equal  values  (this  was  also  the  case  for  the  open-loop 
analysis  (Ref.  15));  it  is  therefore  tempting  to  associate  each  such  pair  with 
a particrlar  mode.  This  intuitive  conclusion  is  validated  by  the  "pattern"  of 
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the  balancing  transformation  shown  in  Table  IV. 2.  The  balancing 
transformation  is  very  "sparse";  in  other  words,  the  balancing  transformation 
merely  relaljels  the  modal  coordinates.  Observe  that  the  pattern  of  the 
balancing  transformation  is  not  significantly  affected  by  the  relative 
damping.  In  other  words,  the  closed-- loop  princip^  ccanponent  analysis  does  not 
depend  too  much  on  the  (poorly  known)  structural  damping.  Surprisingly  enougji, 
observe  that  the  modes  are  classified  by  decreasing  order  of  importance  as 
follows : 

6,7,8,5,4,3,9,10,2,11,1,12 

IV.  3.  force  actuator/ rate  sensor  transfer  matrix;  nominal  data;  nonzero 
damping;  r^=^!,  q„,^1 , r„pt 

We  now  examine  how  the  above  classification  is  affected  when  the  quadratic 

criterion  and  the  noise  statistics  are  chosen  so  as  to  boost  the  inverse 

•“1 

return  difference  I-(KG-„)  , the  size  of  this  matrix  being  somehow 

proportional  to  the  stability  margin.  The  results  are  shown  in  Table  IV. 3* 
Sanehow  thejr  are  not  quite  different  fran  the  preceding  case.  The  relative 
spread  of  the  M's  is  about  the  same.  The  balancing  transformation  is  still 
sparse,  thereby  relabd.ing  the  modal  coordinate.  The  whole  analysis  is  not 
that  much  ai’fected  by  the  jHrameter  €.  In  all  of  these  cases,  the  closed- loop 
principal  exponent  analysis  consistently  classifies  the  modes  by  order  of 
decreasing  importance  as  follows: 

6,7,8,5,4,3,9,10,2,11,1,12 
that  is,  the  same  as  for  the  preceding  case. 


IV.  4.  force  actuator/ position  sensor  transfer  matrix;  naninal  data;  no 
damping;  q^^r^^q^^r^^l 


We  now  look  at  the  case  where  the  controller  uti].izes  the  output  y_  of  the 
position  sensors  located  in  the  bipod  legs,  that  is,  in  the  quadi’atic 
criterion  (III. 2)  we  take  y=y-p.  We  are  not  yet  quite  concerned  about  the 
stsbiility  margin,  that  is,  we^ normalize  the  performance  criterion  and  the 
noise  statistics  as  . The  results  of  the  closed- loop  principai. 

component  analysis  of  are  given  in  Table  IV.  4*  All  of  the  M's  are  very 

closer  to  one,  within  lO'^.  The  spectrum  of  the  closed- loop  singular  values  of 
the  transfer  function  fron  the  force  actuators  to  the  dispLacanent  sensors  is 
surprisingly  concentrated,  while  the  spectrum  of  the  open- loop  singular  values 
is  much  broiader  (Refs.  15,16).  It  is  thus  difficult  to  classify  the  modes  by 
order  of  importance.  The  balancing  transformation  is  not  quite  sparse,  simply 
because  it  is  hard  to  make  a distinction  between  the  modes. 

IV. 5"  force  actuator/ position  sensor  transfer  matrix;  nominal  data; 
nonztjro  damping;  qr'-rn=qr„-r,„=1 

We  now  look  at  how  the  closed- loop  principal  component  analysis  of  Gp(s)  is 
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affected  by  the  damping.  The  results  are  summarized  in  Table  IV. 5*  Clearly, 
the  introduction  of  a relative  damping  somehow  spreads  the  |#'s,  and  a 
classification  is  now  possible.  As  a result,  the  balancing  transformation  is 
"sinrse";  however,  the  balancing  transformation  does  not  exactly  relabel  the 
modal  coordinates.  Ibr  example,  for  SmD.01,  the  balanced  coordinates 
7’  ^ depend  on  the  modal  coordinates  of  both  eigertnodes  4 and  7; 

somehow,  the  eigenmodes  4 and  7 are  intercoupled  in  the  balanced  coordinates 
and  can  hardly  be  dicriminated  importancewise . Also,  the  balanced  coordinates 
Xq,  X|q,  Xii  > eigenmodes  3 and  8.  Thus  the 

classification  of  the  eigenmodes  in  decreasing  order  of  importance  is  as 
follows : 


6, 5, (4, 7), (3, 8), 2,1, 9, 10,11, 12 

The  parentheses  indicate  that  the  modes  are  intercoupled  in  the  balanced 
coordinates  and  cannot  be  discriminated.  This  classification  is  not  the  same 
as  for  the  case  of  rate  sensor  output  feedback.  Further,  this  classification 
is  slightly  affected  by  the  damping,  thoigh  an  "invariant"  feature  is  that 
eigenmodes  9,  10,  11,  and  12  are  always  classified  as  nonimportant . 


IV.  6.  force  actuaty/positon  sensor  transfer  matrix;  nominal  data;  nonzero 
damping;  q^^l , r^^b,  r^,^1 

The  quadratic  criterion  and  the  noise  statistics  are  now  chosen  so  as  to 
boost  the  stability  margin,  that  is,  the  "size"  of  I-(Cj^)“  , for  the  position 
sensor  output  feedback  case.  The  results  are  summarize  in  Table  IV. 6.  The 
|#'s  are  about  as  well  spread  as  for  the  preceding  case.  Further,  the 
balancing  transformation  is  about  as  "sparse"  as  for  the  preceding  case,  which 
means  that  the  balancing  does  not  exactly  reoi’der  the  modal  coordinates.  For 
€=0.1,  the  classification  is  as  follov;s: 


(5, 6), (4, 7), (3, 8), 2,1, 9, 10,11, 12 


Bor  €=0.01,  the  classification  becomes 

(3, 4, 5, 6,7, 8), 2, 1,9, 10, 11, 12 

Thus , the  classification  depends  on  the  stability  margin  requirement . As  the 
stability  margin  increases,  the  coupling  of  sane  modes,  for  example,  6,  7,  and 
8,  increases;  this  is  because  these  modes  have  very  close  eigenfrequencies . As 
the  stability  margin  req-uironent  varies,  the  only  "invariant  feature"  is  that 
eigenmodes  #9,  10,  11,  and  12  are  consistently  decoupled  from  the  others  and 
classified  as  nonimportant;  this  can  be  justified  by  the  big  gap  between  Wq 
and  Wg. 
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IV. 7-  force  actuator/ position  sensor  transfer  matrix;  nominal  data; 
ronzeiro  damping;  q^=€, 

¥e  still  look  at  the  case  of  a displacement  feedback,  but  we  are  now 
concerned  about  the  inverse-return  difference  matrix  I-(K(r„)“  . The  "size"  of 
this  invers«^return  difference  matrix  is  boosted  if  the  ^qiiadratic  criterion 
and  the  noise  covariances  are  taken  as  above.  The  results  are  shown  in  Table 
IV. 7.  Ibr  €s0.1,  the  classification,  by  decreasing  order  of  importance,  is  the 
following : 


(5, 6), (4,7), (3, 8), 2,1, 9, 10, 11, 12 

Under  the  more  stringent  stability  margin  requirement  €*0.01,  this 
classification  becomes 

(3, 4, 5, 6,7, 8), 2,1, 9, 10,11, 12 

Thus  the  classification  is  the  same  as  for  the  preceding  case.  Ibrther, 
competing  Tables  IV. 6 and  IV.7  shows  that  this  case  is  very  close  to  the 
preceding.  This  observation  is  reassuring.  Indeed,  in  the  multivariable 
feedback  loop,  the  stability  of  both  transfer  matrices  (I-G^)”'  and  (I-KGp)"' 
is  involved,  and  their  stability  margins  are  given  by  the  "sizes"  of'^the 
inverse-return  difference  matrices  I-(G^)“^  and  I-(KG_)“' , respectively.  It 
is  reassuring  to  observe  that  the  boosting  of  both  ^stability  margins  is 
nonconflicting  in  this  approach. 


IV. 8.,  force  actuator/ position  sensor  transfer  matrix;  perturbed  data; 
nonauo  damping;  r^^l , q^'l , r„,^t 

Needless  to  say,  the  modal  data  of  a ISS  is  highly  uncertain.  It  is 
therefore  usseful  to  determine  the  extent  to  which  the  above  analysis,  say  case 
# IV. 7,  is  affected  by  a variation  of  the  mode  eigenfrequencies  and  shapes. 
The  results  of  the  principal  component  analysis  in  the  perturbed  data  case  are 
summ;arized  in  Table  IV.  8.  Observe  that  the  results  are  close  to  those  of  case 
study#  IV. 8.  If  €=0.1,  the  classification  is  as  follows; 

6, 5,(4, 7,8),3,2,1, 10,11, 9,12 

Now,  if  the  stability  margin  requirement  is  more  stringent,  i.e.,  €=0.01,  the 
classification  becomes : 


(4,5,6,7,8,), 3,2,1, 10,11, 9,12 

Thus  increaising  the  stability  margin  requirement  produces  more  intercoupling 
between  sane  modes , that  is , the  same  tendency  as  for  the  naninal  data  case. 
It  seems,  therefore,  that  the  closed- loop  principal  component  analysis  picks 
up  the  "fine"  structure  of  the  system,  and  does  not  rely  too  much  on  mode 
eigenfrequencies  and  shapes.  This  is  reassuring. 


173 


IV. 9-  force  actuator/position  sensor;  nominal  data;  nonzero  damping; 

In  all  of  the  previoias  cases,  we  have  stabilized  the  whole  system,  without 
particular  attention  to  the  deviation  of  the  tetrahedral  apex.  Here  we 
introduce  a quadratic  criterion  -vdiich  explicitly  penalizes  the  deviation  y_  of 
the  apex.  ¥e  have  taken  the  other  parameters  are  taken  so  as  to  take 

care  of  the  stability  margin , i.e.,  qQ==€,  r^-l , Qjjf  1 , r^gie) , with  €^.1.  The 
results  are  shown  in  Table  IV. 9-  Observe  that  the  M's  are  fairly  well  spread, 
and  that  the  balancing  transfoimation  has  a much  more  diagonal  structure. 
This  means  that,  if  one  is  primarily  concerned  with  the  deflection  of  the 
apex,  the  modes  are  somehow  classified  by  decreasing  frequencies.  Rananber, 
however,  that  because  of  the  penalty  on  y^^  we  lose  on  the  stability  margin . 
There  thus  appears  to  be  a conflict  between  precise  control  of  the  apex  and 
good  stability  margin . 


V.  CONCLUSIONS 

We  have  presented  in  this  paper  a procedure  for  evaluating  the  "dynamical 
importance"  of  each  vibration  mode  in  a Large  Space  Structure  operating  under 
feedback  control.  The  main  conclusion  we  have  drawn  is  that  the  closed- loop 
importance  of  each  vibration  mode  depends  on  the  robustness  requiranents . 
A1.SO,  there  appears  to  be  a conflict  between  precise  control  of  a given  node 
of  the  structure  and  stability  margin,  these  two  requiranents  leading  to 
contradictory  classifications  of  the  modes  by  order  of  importance.  A refined 
version  of  this  analysis,  using  frequency-dependent  weightings  (Refs.  1-3)  to 
take  care  of  noise,  sensor,  and  actuator  bandwidths,  is  postponed  to  a further 
paper . 
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TB&HSPbfi  PONCTXUli  PBOB  FOhCi  ACTUATORS  TO  RATR  SRNSOiS 
HOniRAL  DATA;  RfLATXPE  DAfflNG  ZAtA  « .UOUOOOOUUOOUaaaOUORiOO 


HU  1 « . 100U000U00000U04^D^01 
HU  2 » • 100UOOOOOOUObUOX74D«01 
HU  3 « •10000000000000003tD*Ot 
HU  4 » . 100UOUOOOOOOOU0027D«01 
HU  5 « • lUOOOUOOUOOOOOU01bU«01 
HO  6 .lOOUOOOOOOOOOOOOUUD^OI 
MU  7 » .lOOOOOOaOOOOOuOOOHD^OI 
HU  a « . jo5uooouooooooooii3D^oi 
HU  9 s .100000000000000003D^01 
H0 10  e .1000u00u00000a0uu20«01 
HU11  * .ioouooouooooouaoo2u«oi 
HU12  » olOOOOOOOOOOOOOOOOIO^OI 
HD13  » o1OOOOOOUOUOUOOOOU10<»O1 
H0 14  * .10000000000000000104^01 
HOIS  « .1000000000000000000401 
HUli  * .1000050000000000000401 
HU  17  * .1000000000000000UOD401 
HU  16  « •040099^0999000999b0-»00 
H019  s .9999999999999999930400 
HU20  ■ .9999999999999999^0400 
HU21  > .9999999999999999890400 
HU22  » .9999999999999999820400 
HU23  « .9999999999999999700400 
HU24  > .9999999999999999660400 


(MLAfJCCA) 


0666  & 6 4 

6 66466  6 6 

6666  6 46666  6 

6666  6 6 664 

6666  66  6 466  666 

66  466646  4 

6 6 6 *6  *66  466  6 6 

666  4 66466  66 
66466666666  6 

666666664666  46666 
66  666*66  66*66 
6 6 666  *6  6 
666  6 *666666  66 
666  *6666  66 
•66  666  66*6  6 
666  6 6 66*66 
6 6 6 *66 


*6 

6 6 

6 46 

6 6 6 46 
6666  466 
6 * 6 6 


(HO  dal) 


FDttCTXOH  PBOH  K)OCS  ACTUATORS  TO  RATE  SRHSORS 
SOfilHAL  DATA;  ERLATXVB  OAHRIHG  ZBTA  « .100000000000000000D-02 


HU  1 » 
HO  2 * 
HU  3 » 
HU  4 « 


.9B0819643<l3a51‘lilS90«00 
.9B07639>I23374927S2D«00 
.9792723 10B9370  3b780«00 
0191«S936o8b69O*00 
39063997911 IS&OtOO 

9919367B25316D*0U 
74542769a2  2 7 32D»(l0 
75548 1229380870*00 
.9650025725055386480*00 
.9504855311624059750*00 
.950429884095  3388410*00 
.9405145193022852850*00 
•2!f5?S§267294 83  76  640*00 
.9095126976900447860*00 
.9094366058788398030*00 

.Aaottoaiiao  'jonacLf^iion^ aa 


- .82  54392  95813  003  5940  *00 

• :8l^8l8S3S8S^18lH88S:88 


6 4 

4 6 

6 

« 

6 

4 6 

6 


Pattern  of  balancing  transformation  X 
a blank  denotes  an  entry  less  than  0.1 
4 denotes  an  entry  between  0.1  and  1.0 
* denotes  an  entry  greater  than  1.0 


TBAMbPliM  fUMCTlUN  fiiUh  faiJuR  ACTUATUab  TU  HATL  ShHSURb 
NOHiNAL  DATA;  UALAT1V8  OAiiPXbG  6MTA  » « 10000UU0u0OUu0uUU0O~Ul 


HU  1 - •6^58642nB5U0712bbV<U«00 

HU  2 » .84bBb424b4b49l72bo0t0U 

nU  3 - «b249lJjBbVbJ21b44bb«U0 

nu  4 > .b24bb14mib03695mDt0U 

MU  b « •b12^3dd4o740392B22U«00 

HU  6 s .b11b73644l93714442U*U0 
HU  7 « o7y8^724J04477932bbD»00 

HO  b * .7977116508912424  VtO^UU 

HU  9 > •74910Ubb505Ub9B24b0^U0 

HU  10  ~ .74Bb97j1o2l9l397jb0^00 

null  = .7u570o22b34b3755b4U«U0 

MU12  = .7051396  74995  173724U>(iU 

HU  13  =■  .blJ6437U9511bJ229J0«00 

HU14  s .6l31960379b71045090*0u 
HU  15  = .55960UU6154/35b7b4b«00 

HU16  » •559lJ20b71728Jl114D«00 

H017  - •4293U05319u979e405U«UU 

HU  IB  » .42Bb4353c5727B29240«00 

HU19  = . 377b1 109U9b9922b^lU«UU 

HU20  > .3773506 199b65l43b50tUO 

HU21  - .24372u34U95bb2799bl>«00 

HU22  s .24345bl90167o252270«U0 
HU23  » .527292bb2o54299U^D-Ul 

HU24  » .5251572436036231790-01 


6 4 

6 * 6 


TBANSPAR  PUNCTIUN  PROM  FORCE  ACTUATORS  TO  BATA  SEN.50BS 
HOnlNAL  DATA;  RELATIVE  DAMPING  ZETA  » .1000000000000000000-01 

LQG  DESIGN  TONED  TO  OPTIHAL  STABILITY  HABGIN  I-IHV(KG);  EPSILON  - 


• ioouooooouoouoooooo«oo 


HU  1 » 
HU  2 - 
HU  3 » 
HU  4 « 
MU  5 
HU  6 s 


.2386625903699238620^00 
•23dbB2590362563999D*00 
.22955 1503854 3560450^00 
.22936  3653422  5282BdD-»  00 
.2241937949894509260400 
239339020146592070*00 
185076146294367590*00 
216189736970 3751680*00 
1995180166245711020*00 
1992944816095921350*00 
1842682713243013320*00 
1840480767228578310*00 
1559925770475137510*00 
1557961674596242130*00 
1413418222689312370*00 
1411421869577625910*00 
1096362626740541690*00 
1094490929410572150*00 
9778065906336741240-01 
9768329599368490130-01 
6744825263674409790-01 
67  339  46  24284  02  4304D-0 1 
1624973561349186^0-01 
616456482586294^0 


* & 

& 6 


177 


t6&  6£ 

g ggg 

g 

g ggg 

ggg 

g 

gg  gg 

*g 

g 

gggg  g 

ggg 

ggg 

gggg 

g 

gg  g 

gg 

gg 

gg  gg 

g 

4g 

J'UNCTXOBi  i'bOa  FObCfi  ACTUAXOBS  TU  POSITION  SBMSOmS 
NOIIXNAL  DATA;  BBLATiVS  OAdPlNG  » *000000000000a000000^00 

AU  1 « .101423926472700b7?i)«C)]  f 

HU  2 * «101M0420b679622U19D«Ul  & 

HU  3 « •1U13943V73S8jb9b11D*U1  S$ 

HU  4 » .t013940950a5  99027bD«01  ! 6t>& 

HU  5 s . 1013293915924474b90^0j  gg  gg 

HU  6 « •101j223UB021b15348Ut01  6 

HU  7 « •100096029C&9211b&9l)^01  ggg 

SO  8 « •100U9239373|79145SO«01  gg  g 

U 9 « •1000342486&5616451D^01  gg  gg 

HD10  « .10Q031915399929245D-»01  gggg 

H0 11  * .100U2U8b23242S2B40l>«01  I g g 

HU12  B . 100000046204 1004bb0^01 

H013  * •999999&379S92205410«00 

H014  * .9997914202720529!^D^U0  g g g 

HU15  « •99968094782748674Ed«00  gggg 

HUIb  » .999bS7b30400903S37D400  gg  gg 

HU17  » •99907b91S5l432930lO^00  gg  g 

HUIB  » •99904g63Q681216aiU0«00  ggg 

H019  * •986949487754 13bi870>00  gggg  g 

HU20  « •98b919452551bS3b02U^00  gg  gg 

HU21  « •9662507270762S84710>00  g ggg 

H022  « .9d6247?8691231bl920«Ug  ggg  gg 

fl02J  • •98615238042a494631D«(lO  g 4g 

R02g  • «9S59636483603042320«00 


gg  gg 

g 

•g 

gg  g 

gg 

gg 

ggg 

gg4g 

g 

gggg  6 

ggg 

gg  gg 

•g 

g 

g ggg 

ggg 

g 

ggg  gg 

g ggg 

g 

g *g 

ggg 

4g 

g 

TbANSfbb  PUNCTiON  PfiUH  POfiCE  ACTUATOBS  TO  POSITION  SKNSOBS 
NOHINAL  DATA;  BBLATIVB  0ABPIN6  Z£TA  » • IOUOOOOOOOOOOOOOOOD-01 


HU  1 s 
HU  2 « 
HU  3 ~ 
HU  4 s 
HO  5 ^ 
HU  6 s 
HU  7 s 


HU21  « 
H022  s 
HU23  - 


•b26649S7601262B9bS0^00 
. 508722088066 390844D40U 
. 50023 1555295006705U>00 
•484065309965 1551860«00 
•4b 4470725932 59 514bD«00 

• 45120957672901  bo 55D^OO 
•450402698529599948D«00 
. 4380088 2 1 9 93 0 J 7 1 360 t 00 
•422573649241 5785540>00 
•412726605442 1788680^00 
•41046696142084 59410^00 
•40 16037224 16 7431220^00 

• 29  3 17 1 1 20  375  68  6 7 960^00 
•26O328945045 37242b0«00 

• 18  840  09  26772  65  26 130 ♦OO 
•1844060674297293870^00 

• 1150788 106653954470^00 
•112742164652 7205810^00 

• 88  342  9 0 395  89  32  0 2 860-01 

• 66  566  80969995200980-01 
.4327616379852002^0-01 

• 424156671872  6766^0-01 


*6 

«g 

♦g 

«g 

•g  g 

g *8 

#g  g 

*g 

gg  4g 

•g  g 

g gg 

gg  g 


« 40679  4406  008  93  72 10D-02 


XBANSFHa  FUNCTION  FbOH  FOBCB  ACTUATOBS  TO  POSITION  SSNSOBS 
NUHUAL  DATA:  BEUTIVE  OAHPING  ZETA  » •1000000000000080000-01 

LQG  OSSIGI  TONED  TU  OPTIHAI  STABILITT  HABGIH  I-INV (GN);  EPSILON 


•10U000O0U000O0O0O0O400 


HU  1 « 
HO  2 « 


•1345006963775392750^00 
. I2785508b1726l4530o«00 

• 12  7085S  2 0 0 5 8 9 1 5 9 620«00 

• 12123218132055e  7 68D«00 
. 119189150560  374 /9bO^OU 

• 1159248047  125o  8 8 06l>«U0 
•1134960 12 1912938 50D>00 

• 110644592 1484877 IbO^OO 

• 1094722721108175710^00 
. 106b775779561104b90>00 
. lU46449189l90092b4D«00 

• 1023249566444379290^0U 
.7939416139160675990-01 

• 76864042795998  79100-01 
.54  2740326  02044  51150-01 

• ^6909549704  33  6j  220-01 

• 34854  4752  9 48  918  5 800-01 
•34082841212569 14010-01 
. 27  18426013  13 39  45  160-01 
•266078640 1310882400-01 

• 1358525963122892400-01 
•1331139564111034570-01 

• 13114  6 3 759  9086  7 9 860-02 

• 12859  9786708  630  4000-02 


•g 

gg 

gg 

gg 

gg  g 

gg  #g 

gg  g 

g gg 

gg  gg 

gg  gg 

!l 

gg  gg 


TBANSFEB  FUNCTION  FBOH  FOBCE  ACTUATOBS  TO  POSITION  SENSOBS 

HOHINAL  DATA:  BELATIVE  DAHPIBG  ZETA  « .1000000000000080000-01  ^ 

LQG  DESIGN  TONED  TO  OPTIHAL  STABILITY  HANGIN;  EPSILON  » .1000080000000000000-01 


•2756697910105721700-01 
.262213497076064  lOo 0-01 
6062950923650480-01 
6827  155048264330-01 
374711740328110370-01 
28749740256 1034750-01 
•2278029864948080350-01 
.2182640320507062290-01 
.2165876694079603740-01 
•2109871988106912940-01 
•2041308572847609720-01 
. 1964064217525674MD-01 
. 1o8796030793b29760D-01 
1571232990128763410-01 
1239978009916668520-01 
.1180898927187954690-01 
• 68  420  46  2 34  40  64  6b65D-02 
.65373 18853278784040-02 
•7262482200130233690-02 
. 7046662 129 126097810-02 
•4026673278450604350-02 
.3934756060269103560-02 
•4135676521621388850-03 
•4056252933437162610-03 


I-(6K)* 


gg 

g gg  g 

gg  gg 

I g gg  gg 

^ gg 

gg  gg  gg 

6 gg  gg  ;g, 

gggg  g ggg 

I gg  gg 

4g 

gggg  gg 

gg  gg 

ggg  g s 

ggg  g g 

6g  g 

gg  g 

gg 

gg 
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TBANSPEB  FUBCTIUN  FBUH  FUBCB  AC'FUAl'OBS  10  POSinutl  SBNSOBS 
NOHXMAL  DATA:  BBLAriVB  DAnPlHG  Z£IA  . lUUUUUUOOUuGUOUOOOU-U  1 
IQG  DEBIGN  iAnBD  TO  OFTIHAL  bTABUlTI  HABGIN  l-lMV(Ka);  EPSILOH 


. lUUOOOUUUOUOUOUOOOlH’OO 


HU  4 = 
HU  b » 


. 104500696377530  jlOB+00 
. l37H550Bb1736l4496i)«0u 
. 1B70B552005U9166400«00 
. i;!l2321tn32055879bb'»0u 
.1 19169 15050037 4b bbbtOO 
. 1159248047 l25b922JD*00 
. 1134960 12 1912937b7b«00 
. 110644592 146487405U400 
. 109472272 1 1081 76200400 
. 106677577  956 1104260400 
. 1046449189190092290400 
. 1023249566444379410400 
.79394 16 139 1b0o52920-01 
.7o8b4U427959987789b-01 
.5427403260204451390-01 

. 34082  64 12 1 25  6806  5jU-0 1 
.2716426013134172570-01 
.2660786461310799400-01 
.1358525983123000370-01 
. 133113956411 1037590-01 
.1311463759910452690-02 
. 1285997867086809640-02 


IBANSFBB  FUBCTIOB  FBOH  FOKLt  ACTUAtOHh  TO  PublTIOB  SHHSOnS 
MOHINAL  OATA:  B&LATIV £ 0AHP18S  Z6TA  = .1000000000000000006-01 

LUG  OESIGH  I0H£O  TO  OPXINAL  GTABiLiTl  HABGXH  X-X8V (KG|  ; 8P8XLU8  - 


HU  1 
HU  2 * 
HU  3 o 


.2758897910  105729746-01 
.26  2213497076  06  33450-01 
.2456062950923645010-01 
. 237bB  27  IS  504  8280830-0 1 
.23747 1 174032  6143390-01 
.228749740256 1007910-01 


.1000000000006000000-01 


.2109871988108690480-01 
.20  4 13  08  57284  7588240-0 1 
.1964064217525673190-01 
. 1687960307936304930-01 
. 15  7 12  32  99  0 12  86  9b  390-0 1 
.1239978009916668410-01 
.118089892718  79  54  380-01 
.8842046234410409250-02 
.8537318853277496550-02 


6 && 
& 66 
&£ 

66 

e 

66 

66 


TBABGFtM  PUMCTXUB  FBOH  FOBCB  ACTUATOBS  TO  POSITXOII  SBH60BS 
FBBTUB8B0  DATA:  BELATIVE  MHPING  ZEIA  > .1OO0O000U0OOOOO0OOO-O1 
LOG  OBsIgM  tOI&D  to  OFTIHAL  STABUITT  HABGIH  I-LHV  KG)  ; BFSILOH  > 

1046044451469824730400  T *6 

1003161936571399510*00  *8 

44855433996020-01  66 

ia?fia’iTO^f8?2^8=8^  6"*^  6. 

6981936999165364940-01  66  6 

8850340  951402951 100-0 1 

28891883000679010-01  6 *t 

68955125441288300-01  66  6 

0605890807715860-01 

170U91298098042r20-01  66 

6956815497604935630-01  66 

*■  97050-01  66 

29155826905760-01  66 

6510831167448410-01  66 

1402258060588590-01  66 

20  07M  582  54  56  430-0 1 
254123251717232320-01 
1624768659822061720-01 
15870  39  77  346  6072  950-0 1 
1503495455178210 

HO 23  > .j4lo3Ssl34iS37 94820-03  I 

H024  • .7284968097999418100-03  I 


. 1000000000000000000*00 


*6 

•6 

6 *6 

•6 

•6  6 

*6 


TBANSFBB  FUHCTIOII  FBOH  FOM-'E  ACTUATOBS  TO  FOSITXOi  SEHSOBS 
FEBTUBilEO  OATA:  ilBLATlVE  DAHFING  ZETA  ° .lOOOOOOOOOOOOOOOOOlB^I 
LOG  OKilGH  TUlio  TO  OFTIHAL  StABXLItl  HABGIH  X-1H4  (KG)  ; BFSILOH  > 


. 1000U0000000000000O-O1 


.2166536386182275310-01 
.2017466061661002150-01 
. 1952029856755207400-01 


HU  8 > 
HU  9 > 
HU  10  > 
8011“ 
HU  12  » 
H0 13  > 
HU14  < 
HO  15  > 


19 127  11<te  267  610  8 560-01 
18837202171406  38330-01 
18694 18 17643 5878520-01 
18367971>0675  7484 110-01 


06  38330-01 
5878520-0 


175195127  7 90  56  3 2 200-01 
1731949126662283070-01 
1718731 70 6926558550-01 
1561955913370956470-01 
1462598014718757650-01 
1169417382434638910-01 
1117035965816931070-01 
8650922229957750790-02 
8358929823474522050-02 
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CONTROL  OF  LARGE  SPACE  ANTENNAS  BASED  ON 
ELECTROMAGNETIC-STRUCTURAL  MODELS 

Massih  Hamidi  and  Farzin  Manshadi,  Jet  Propulsion  Laboratory, 

California  Institute  of  Technology 
Pasadena,  CA  91109 


ABSTRACT 

A general  approach  to  the  optimal  control  of  large  space  antennas  based 
on  their  RF/structural  characteristics  is  described.  The  approach  consists 
of  defining  a cost  functional  based  on  the  degradation  of  the  RF  performance 
of  the  antenna  and  using  the  structural  model  as  the  dynamic  system.  The  method 
is  applied  to  the  design  of  an  optimal  controller  for  a 55-m,  wrap-rib  offset- 
fed  antenna.  The  controller's  goal  is  to  minimize  the  variations  of  the  peak 
electric  field  of  the  antenna  due  to  feed  displacements. 

INTRODUCTION 

The  control  of  the  RF  properties  of  antennas  is  traditionally  achieved  by 
implementing  and  following  a set  of  geometrical  considerations  such  as 
pointing  accuracy,  shape  distortion,  feed  position,  stability,  etc.  These 
geometrical  requirements  are  very  stringent  because  of  the  way  in  which  they 
are  conceived.  The  relative  importance  of  different  parameters  of  a geo- 
metrical configuration  in  RF  performance  degradation  usually  can  not  be 
determined.  Each  parameter  is  therefore  considered  separately  and  uniform 
criteria  are  derived  which,  in  order  to  meet  a worst-case  situation, 
lead  to  very  restrictive  conditions.  This  study  alms  to  use  the  RF  performance 
degradation  directly  as  the  means  of  defining  the  controller,  and  thus  to 
circumvent  the  stringency  of  the  geometrical  methods  . 

This  paper  is  organized  into  three  sections.  The  first  describes  the 
methods  used  for  the  computations  of  the  "far-field"  electromagnetic  proper- 
ties of  the  antenna  and  for  the  derivation  of  the  "sensitivity"  of  these 
properties  to  different  parameter  variations.  The  second  section  briefly 
discusses  a general  approach  to  determining  quadratic  cost  functionals  based  on 
the  RF  characteristics  of  the  antenna  system.  This  reduces  the  RF  optimal 
controller  design  to  a subclass  of  the  linear  quadratic  optimal  controller 
design  probleim  whose  solution  is  well-known.  To  illustrate  the  method,  the 
design  of  an  RF  optimal  controller  for  the  55-m,  offset-fed,  wrap-rib  antenna 
considered  in  the  LMSS  [Ref.  1]  project  is  presented  in  section  3.  The 
goal  of  the  controller  is  to  minimize  the  variations  of  the  peak  electric 
field  of  the  antenna,  caused  by  feed  displacements. 

RF  MODEL  DEVELOPMENT 

Consider  the  reflector  antenna  shown  in  Figure  1.  The  reflector  surface 
E is  constructed  by  intersecting  a circular  cylinder  with  an  arbitrary  curved 
surface.  The  cylinder  axis  is  taken  as  the  z axis;  the  x-y  plane  is  perpen- 
dicular to  the  z axis.  The  projection  of  E on  the  x-y  plane  is  the  circular 
region  a with  radius  a.  The  feed  is  assumed  to  be  located  at  some  arbitrary 
point 
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Figure  1.  Offset  Shaped  Reflector  Antenna 


The  far-field  approximations  for  the  scattered  electric  and  magnetic 
fields  of  the  offset-fed  antenna  are  readily  evaluated  using  a technique 
developed  by  Y.  Rahmat-Samii  and  V.  Galindo-Israel  [Ref.  2] . A brief  summary 
of  this  technique  is  presented  in  the  following  paragraphs. 

The  electromagnetic  fields  E and  H are  given  by 


H = V X A 


E 


1 

j0)£ 


V X H 


A is  the  vector  potential  expressed  as: 


A = 


-jk  r-r ' 
e ' 

4it  I r-r  ' I 


ds  ' 


where  J is  the  induced  current  on  the  reflector  surface  due  to  the  magnetic 
field  Hs  radiated  by  the  source.  It  is  given  by  [Ref.  3] 


J = 2n  X H 

s 

where  n is  the  unit  vector  normal  to  the  reflector's  surface  and  directed 
towards  the  positive  direction  of  the  z axis. 


Introducing  the  far-field  approximations  in  the  expressions  for  E and  H, 
one  obtains  [Ref.  2] 


” j lex* 

■ j''  TSS-  e - T0  i)  + 0(r‘b 

- - -2 

[e  =-Jkn  (Tg  0 + T^  <^)  + 0(r  ) 
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where  ti  = /—  , the  parameters  (r,0,<f))  are  the  coordinates  of 

A t ^ 

point , 6 and  (j>  the  unit  vectors  corresponding  to  0 and  (j> , and 
integral  given  by 


the  observation 
T the  radiation 


T = 


J(r')  ds' 


It  has  been  found  that  the  radiation  integral  provides  a very  accurate 
solution  for  predicting  the  far -field  radiations  of  reflector  antennas.  The 
necessity  of  having  an  efficient  technique  for  the  evaluation  of  this  integral 
stems  from  the  fact  that  it  has  to  be  computed  accurately  each  time  the  observa- 
tion angles  change.  Moreover,  the  integrand  of  the  radiation  integral 
oscillates  rapidly  and  thus  makes  the  integration  more  strenuous 
for  large  reflectors. 


To  circumvent  the  difficulties  in  the  integration,  the  radiation  integral 
is  first  expressed  in  terms  of  a summation  of  Fourier  Tr.ansforms  of  an 
"effective"  aperture  distribution 


p-Xx> 


T = I G (0) 
p=0  P 


2n 


fa 


0 J 


0 


where  B and  $ are  functions  of  the  observation  angles  0 and  (j) . 

The  Fourier  Transform  integrals  are  then  expanded  in  terms  of  Jacobi-Bessel 
series  by  writing  that 


->co  ->oo 


Q (as',<j)')  = I ^ [ C „D  ] 

P n=0  m=0  P 


cosn(|) ' 
sinn(}) 


where  F (.)  are  the  modified  Jacobi  polynomials  defined  by 


m 


F '^(x)  = v/2(n+2m+l)'  P ”-(l-2x^)  x’^ 
m m 


/ ^ s m r*  /-V 

Sr  Ki-xb”(i~x)^] 

dx 


and  the  coefficients  _C  and  are  given  by 

P nm  P nm 


P*^nm 
P nm 
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2ir 


y 2tt  ^0  JQ  1 


1 n=0 

2 n?^0 
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^ F (s')s'ds’d<j)' 
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Finally,  the  radiation  integral  is  expressed  as 


T 


27ra^ 


p-X»  ^*^co 

I G (0)  I I j" 

p=0  ^ n=0  m=0 


[ c 

p nm 


D ] 
p nm 


cosn$ 

sinn$ 


/ 2 (n+2m+l )' . 


V2m+1^^^^^ 

kaB 


where  J (.)  indicates  the  Bessel  function  of  order  n. 
n 

The  method  has  several  important  features  which  may  be  summarized  as 
follows : 


1)  Higher  order  coefficients  pCnm  P^nm  ^^.n  be  calculated  from  zero 
order  coefficients  oCnm  oDiun  and  by  use  of  recursion  relations. 

2)  Once  pCnm  and  pDnm  are  determined  they  can  be  used  for  all  observa- 
tion angles . 

3)  The  numerical  integrations  involved  in  the  computation  of  oCnm  and 
oDnm  do  not  contain  the  highly  oscillatory  Fourier  Transform  kernel  of  the 
original  expression. 

To  illustrate  the  variation  of  the  RF  pattern  of  a reflector  antenna  as 
a function  of  its  feed  location,  the  far  fields  of  a 55-m  parabolic  reflector 
antenna  are  plotted  for  four  different  feed  locations:  (1)  feed  at  focal 

point,  (2)  feed  displaced  in  the  x direction  with  y and  z constant. 

(3)  feed  displaced  in  the  y direction  with  x and  z constant,  and  (4)  feed 
displaced  in  the  z direction  with  x and  y constant.  (Figures  2 through  5). 
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MAGNITUDE  (DB) 


As  can  be  noticed,  displacements  along  the  x and  y axes  produce  signifi- 
cant variations  in  the  gain  while  displacements  along  the  z axis  have  almost 
negligible  effects.  It  can  thus  be  observed  that  to  optimize  the  gain  most 
of  the  effort  should  be  concentrated  on  controlling  the  feed  displacements 
along  the  x and  y directions  and  that  the  control  along  the  z direction  can 
be  relatively  loose. 


COST  FUNCTIONAL  DERIVATION 

Let  P be  an  RF  performance  parameter  to  be  optimized  e.g.,  gain,  band- 
width, magnitude  of  the  electric  or  magnetic  field);  Let  xq^  , X2 , . . . 
be  the  independent  variables  whose  variations  affect  P:  P = PCx^,  X2 , ...  xn) ; 
and  let  x = (x-j^  X2  ...  Xn)"'-  . Suppose  P assumes  its  optimal  value  at 
XQ  = (^10  ^20  •••  ^no)^>  Around  this  point  P can  be  expressed  as 


P(x)  = P(x^)  + VP 


X 

o 


1 T 
Ax  + y Ax 


Ax  + . . . 


where  P and  H denote  the  gradient  vector  and  the  Hessian  matrix  of  P with 
respect  to  x and  Ax  = x - xq . 


Since  P is  optimum  at  x^ 


VP 


X 

0 


= 0 


and  H 


is  positive  definite  or  negative  definite  depending  on  whether  P 

r\ 


xs 


minimum  or  maximum  at  x 

r 


Hence,  around  the  given  point, 

P(x)  = P(x^)  + ^ Ax^  H Ax  + ... 

1 1 T I 

and  we  can  approach  the  optimum  value  of  P by  minimizing  Ax  HAxl . Note 
that  if  this  minimum  reduces  to  zero, we  actually  attain  the  optimum  value  of  P. 
Note  also  that  if  H > 0, 

I 1 T I IT 

Ax  H Ax|  = -^  Ax  H Ax 


and  if  H < 0 

Ax"^  H Axl  = j Ax^  (-H)  Ax 
and  that  in  both  cases, 

Ax'^  H Ax|  = ^ Ax'^  A Ax 


where  A is  a positive  definite  matrix. 
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The  antenna  is  modeled  by  an  equation  of  the  form 
My  + Ky  = Bu  y(t^)  = y^, 

It  is  always  possible  to  relate  Ax  to  y by  an  equation  of  the  form 
Ax  = Ty „ Hence,  minimizing 


A Ax 


and  reducing  it  to  zero  is  equivalent  to  minimizing 


1 

2 


T T 

y T ATy  dt 


We  can  thus  formulate  a linear  quadratic  optimal  control  problem  by  writing  , 
minimize 


1 

2 


(y^l'^ATy  + u'^Ru)  dt 


subject  to 


My  + Ky  = Bu 


y(t  ) = y. 


y(t^) 


The  term 


T 

u Ru  dt 


^ 0 


where  R is  a positive  definite  matrix,  is  added  to  account  for  the  restrictions 
in  energy  consumption  for  the  control. 

t 

The  rest  of  the  procedure  is  classical,  A damping  term  Dy  is  added  to 
the  equation,  and  the  system  is  augmented  by  considering  the  state  vector 
W = (y  y)"^-  This  leads  to  the  system 


W 


0 

-m“^k 


-m“^d , 


w + 


0 

m“^b 


u = FW  + Gu 


associated  with  the  cost  functional 


2 J 


(W^QW  + u'^Ru)  dt 


where 


/ T^AT  0 ' 

I 0 
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The  optimal  control  is  given  by 
u(t)  = - CW(t) 

with 

“1  T 

C = R G K 

where  K is  the  positive  definite  solution  of  the  Riccati  equation 
+ k f + q - k g'^r'^g  K = 0 


FEED-DISH  MOTION  COMPENSATION  FOR  A 
55 -m,  WRAP -RIB,  OFFSET -FED  ANTENNA 

To  illustrate  the  application  of  the  method,  in  the  sequel  we  describe 
the  design  of  an  RF  optimal  controller  for  the  55-m,  of f set-fed, wrap-rib  antenna 
considered  in  the  LMSS  [Ref.  1]  project.  The  controller  is  designed  to  minimize 
the  relative  feed-dish  motion  of  the  antenna. 

A schematic  diagram  of  the  antenna  is  given  in  Figure  6.  It  is  composed 
of  a 55-m  diameter  reflector  dish,  a massive  feed  array,  and  a long  L-shaped 
boom  connecting  the  dish  and  the  feed.  The  antenna's  operation  frequency  is 
871  MHz  which  leads  to  the  values  of  159. 68A  for  the  dish's  diameter  and 
239. 6A  for  the  focal  length  (A  is  the  wavelength  at  871  MHz). 

Construction  of  the  State  Cost  Matrix 

Let  rQ  be  the  vector  from  the  center  of  gravity  of  the  spacecraft's  bus 
to  the  boom's  elbow  and  r±  the  vector  from  the  elbow  to  the  center  of  gravity 
of  the  reflector's  dish.  The  position  of  the  feed  is  characterized  by  the 
vector . 


FEED 


(Spacecraft's  bus)  Figure  6.  Schematic  Diagram  of  the  Antenna 
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Let  Sq  = (xq  Yq  Zq)”^  be  the  position  for  which  the  electric  field  is 
maximized.  For  the  neighborhood  of  this  point 


E = E(s  ) + V E 
o s 


AS  + ^ H(s  ) AS  + ...  = E(s  ) 
z o o 


1 T 

+ ^ AS  H(s  ) AS  + 
z o 


since 


V E 
s 


= 0 


The  value  of  the  electric  field  was  evaluated  using  the  algorithm 
described  in  the  first  section  of  this  paper  for  different  feed  positions.  The 
following  Hessian  matrix  H(sq)  was  obtained  through  numerical  differentiation  of 
the  electric  field  (as  a function  of  feed  position) : 


/ -26.0966 

- .00344 

- .0076 

H(S^2  = 

( 

- .00344 

1 

-24.231 

6.396 

V-  ,0076 

6.396 

-1.96 

To  evaluate  AS,  we  interpret  the  feed-dish  motion  by  considering  the 
feed  and  the  lower  boom,  r^,  fixed,  and  the  dish,  hence  rp,  rotating  about 
the  coordinate  axes  . Thus  , 


As  = Ar^ 

The  antenna's  distortions  are  supposed  to  be  small.  This  leads  to 

Ari  = r^  X r 

where 

r = (Y],  Y2  Y3) 

designates  the  rotation  of  about  the  x,  y and  z axes  of  the  inertial  frame. 
Writing  Ar]^  in  a matrix  form,  we  obtain 


with 


As  - 

D r 

r 

+c 

-b 

D = 

-c 

0 

+a 

V 


-a 


where  a,  b,  and  c are  the  components  of  the  vector  r^:  r^  = (a  b c) 
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Thus , 

ITT  AIT 

AE  = E - E(S  ) = ^ r D HDT  = ^ T HT 
o z z 

H'  = D^HD 


E has  a maximum  at  S . H is  therefore  negative  definite  and  so  is  H'. 
Consequently,  the  state  cost  is  given  by 

|ae1  = J r^(-H')r 

Antenna  Model 

The  antenna  is  modeled  by  the  linear  system 
My  + Ky  = Bu 


where  the  state  vector  y has  12  components  as  follows:  yj^  to  y3  represent  the 
attitude  0y,  and  0^  of  the  spacecraft;  y^  to  y^  the  rotations  Y2 , and 
Y3 ; and  y-j  to  yj!2  the  six  most  important  modes  of  the  reflector  dish. 

The  values  of  the  matrices  M,  K,  and  B are  listed  in  the  appendix.  The 
system  has  a damping  of  zero  on  the  first  three  components  (rigid  body  motions) 
and  0.005  for  the  rest. 


Thus, the  state  cost  weighting  matrix  takes  the  form 


^a-3)(j-3) 


= 0 


4^i^6, 

elsewhere 


Optimal  Feedback  Computation  and  Simulation 

The  OPTSYS  [Ref.  4]  program  package  was  used  to  determine  the  optimal  gain 
for  the  case  where  a three-dimensional  control  is  applied  at  the  spacecraft's 
bus  . The  control  cost  weighting  matrix  was  taken  to  be  the  3x3  identity 
matrix.  The  resulting  gain  matrix  is  also  given  at  the  end  of  this  paper. 

The  optimal  feedback  control  thus  obtained  was  used  to  drive  the  antenna 
system  in  simulation  studies.  The  results  are  given  in  Figure  7. 


CONCLUSION 

The  RF  performance  of  large  flexible  antennas  is  traditionally  achieved  by 
imposing  stringent  geometric  restrictions  on  the  structural  distortions  from  a 
nominal  optimum  configuration.  In  this  paper,  we  have  presented  an  approach 
to  alleviate  the  stringency  of  the  geometrical  criteria  of  satisfactory 


190 


KUTA  I iON  ABOUT  ROTATION  ABOUT 

Y X 10"^  X X 10"^ 


0.00  100  200  300  400  500 


TIME,  sec 


00! 1 1 I I I 

0.00  100  200  300  400  500 


TIME,  sec 


Figure  7.  Feed  Dish  Motion  Variation  with  Time 
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performance » The  approach  consists  in  generating  a linear  optimal  control 
problem  with  quadratic  cost  functional  where  the  cost  functional  is  obtained 
from  the  RF  characteristics  of  the  antenna  and  the  dynamic  system  constraint 
is  given  by  the  structural  model  of  the  antenna. 


The  method  was  applied 
wrap-rib,  offset-fed  antenna 


to  the  feed -dish  motion  compensation  for  a 55-m, 
and  the  time  simulations  presented. 
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APPENDIX 
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^ni  « 

9F  Vl»  « 

f»ni  « 

C0L  3 

CCL  4 

COL  5 

At  4, 

i,ww  7 

ROH 

S 

3,9633086^08 

9,9905909^03 

2«6I0029460| 

2,3171954605 

O.OOOOOOO 

0,0000900 

-486297059-03 

ROW 

i 

9,9905989#OS 

3,5137213606 

•9,9635939*05 

1,9069429603 

1,0663955605 

-3,9865978*06 

■5,7304527*01 

3 

2•6i00244«91 

ffl9,96359S9*0S 

S,6325g@9605 

•2,6580818602 

94,4156145604 

8,3388207*05 

3,0739090*02 

295173954^05 

1,9069429603 

»2, 6580818602 

4,1819109605 

O.OOOOOOO 

0,0000000 

«3,©470785-OS 

s 

1,0663955601 

0,0000909 

»4e55iSS26*66 

a| e It66?95*©2 

ROW 

- '6 

0,0000000 

4.3388207605 

0,0009008 

94,5515826604 

4,6328949605 

3,0739090*02 

? 

95,7304527601 

3,0739090602 

95,0470785-03 

-1,1266795602 

3,0739090602 

1,0000009*00 

ROW 

8 

8e9T86856«0| 

®?,3S9287S60g 

282740119602 

•1, 3301117601 

-4,9267653600 

2,4751272602 

1,1119753-16 

'6.8179oeo«@i 

9,6671360601 

»2a8633@S160i 

98,5794042601 

6,4663564-0! 

-3, 2596941*01 

6,0900000 

sa 

«4,&663060^0S 

o| ,6841067600 

®S*  S3609S6oO| 

93,0711672602 

»|»0826i76e04 

2,9556953-04 

ROW 

H 

o!,37t647S«03 

»!, 6970841601 

•7,5011134600 

-2,0684245602 

-2,4574875*01 

-7,8701537*00 

0,0000000 

ROW 

iS 

|,eo62040«9! 

"I ,2870787602 

-5,8049150601 

2,7459521601 

-1,8688063*02 

-5,9918236*01 

6^9090455-18 

eOL  9 

COL  10 

COL  11 

COL  12 

ROW 

! 

6,6179000692 

94,6663060602 

91.3716475602 

1.8062040601 

ROW 

2 

9.6671360691 

91,6841067600 

91,6974841601 

91,2070787602 

ROW 

S 

«2. 6633031691 

93,1364936-0! 

"7,5011154600 

95,8040150601 

ROW 

4 

Be,3794Q42«0l 

95,0711672602 

•2.0884245602 

2,7459521601 

ROW 

S 

6.4663364«9I 

91,0826176-04 

-2,4574875601 

•1,8688063602 

ROW 

8 

•3,2596941a91 

2,9536933-04 

-7,0701537600 

95.9918236601 

ROW 

7 

O.OOOOOOO 

e4,9897l00«07 

O.OOOOOOO 

6.9498455918 

ROW 

S 

3.26339as»ii 

0,0000009 

-1,6299311-06 

2.8158278-03 

ROW 

R 

9,696435l«9i 

9*0000009 

-2, 8158277-03 

•t,6299488-06 

ROW 

i® 

0,0090909 

9«497084S»01 

97,2874412-12 

9,1993014-13 

ROW 

t f 

m • 

e2,9!582?7s93 

e?92«?sai29!2 

9,9973708*01 

99s6?S6327e!S 

ROW 
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stiffness  matrix  is 
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ROW 
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4 
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5 
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4 
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O.OOOOOOO 
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8 
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0,0000000 
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9 
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10 
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T 
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1.0  Introduction 


In  order  to  assess  the  performance,  sensitivity,  and  hardware  requirements 
of  the  various  active  structural  control  methods  which  are  being  developed,  a 
universal  system  model  is  required.  This  report  contains  a complete  description 
of  ACOSS  Model  #2  which  is  a simple  but  realistic  evaluation  model.  The  design 
of  this  system  was  driven  by  the  desire  to  incorporate  certain  attributes  into 
the  overall  system  characteristics.  The  desired  features  were 

• structural  design  based  on  realistic  sizes  and  weights 

fl  a simple  unclassified  optical  system  with  associated  performance 
measures  and  tolerances 

i a set  of  disturbances  typical  of  equipment  vibration  and  attitude 
control  (slew). 

The  resulting  model  is  described  in  detail  in  section  2.Q  along  with  the  result- 
ing mode  shapes  and  natural  frequencies  of  the  system.  Section  4.1  contains  a 
description  of  the;:l ine-of-sight  (LOS)  performance  measure  including  theory  and 
implementation. 

2.0  Structural  Design 

ACOSS  Model  #2  is  shown  in  Figure  1.  It  consists  of  two  subsystems.  The 
optical  support  structure  and  the  equipment  section,  which  are  connected  by 
springs  at  three  points  to  allow  either  passive  or  active  vibration  isolation. 

The  optical  support  structure  contains  the  four  optical  surfaces  which  are 
assumed  to  be  a rigid  central  section  with  two  flexible  solar  panels  cantilevered 
from  it. 

2.1.  Optical  Support  Structure 

The  optical  support  structure  consists  of  the  upper  mirror  support  truss, 
the  lower  mirror  support  truss  and  the  metering  truss  which  maintains  mirror  sep- 
aration. The  finite  element  model  of  this  subsystem  is  shown  in  Figure  2,  The 
model  contains  35  node  points  and  117  beam  elements.  The  structure  is  designed 


Figure  1.  ACOSS  Model  No.  2 
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Figure  2.  ACOSS  Model  No.  2 - Finite  Element  Model  (Continued) 


to  act  as  a truss  but  it  is  assumed  that  all  joints  allow  a full  moment  connec- 
tion. Thus,  both  bending  and  axial  stiffness  are  included  for  all  members.  All 
structural  elements  are  assumed  to  be  hollow  graphite  epoxy  tubes.  Four  differ- 
ent sizes  of  tubes  are  used  in  this  model  and  their  dimensions  and  section  prop- 
erties are  listed  in  Table  1.  The  optical  surfaces  are  assumed  to  be  rigid  and 
kinematically  mounted  on  the  structure  so  that  they  will  have  no  effect  on  the 
overall  stiffness.  The  mass  of  the  mirrors  will  increase  the  inertia  of  the 
system  and  has  been  included  in  the  model.  The  total  mass  of  this  subsystem, 

5084  kg  which  includes  the  structure,  optical  surfaces,  and  equipment,  has  been 
lumped  at  18  node  points  as  shown  in  Figure  3. 

2.2.2  Equipment  Section 

^ The  equipment  section  is  modelled  as  a central  rigid  body  with  two  flexible 
solar  panels  cantilevered  from  it.  The  finite  element  model 'is  given  in  Figure  4. 
It  has  been  assumed  that  the  rigid  body  portion  contains  all  of  the  equipment, 
guidance  and  navigation,  control  systems,  power  supplies,  that  are  necessary  for 
operation  of  the  system.  Therefore,  a relatively  large  portion  of  the  total  mass, 
3500  kg,  is  concentrated  in  the  rigid  section  at  node  44.  Also  included  at  node 
44  are  the  rotational  inertias  due  to  distribution  of  the  mass  over  the  triangu- 
lar area.  These  inertia  terms  are  listed  in  Figure  7. 

The  solar  panels  are  assumed  to  have  an  area  of  120  m each  and  are  sup- 
ported by  graphite  epoxy  support  booms.  The  mass  and  rotational  inertia  of  the 
solar  panels  are  lumped  at  three  nodes  on  each  boom.  The  lumped  masses  are 
listed  in  Figure  4. 

2.2.3  Isolation  System 

The  optical  support  structure  and  the  equipment  section  are  connected  by  a 
passive  isolator  at  three  points  by  springs  acting  in  all  three  translational 
directions.  The  isolator  is  designed  to  reduce  the  amount  of  force  transmitted 
from  the  equipment  section  to  the  optics.  This  is  accomplished  by  designing  an 
isolator  with  the  natural  frequency  much  lower  than  expected  disturbance  frequen- 
cies. In  addition,  the  isolator  frequency  should  be  lower  than  the  optical  ■ 
structure  bending  frequency.  A passive  isolator  consists  of  a spring  and  a vis- 
cous dashpot  in  parallel  as  shown  in  Figure  5. 
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Figure  5.  Passive  isolator. 
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Table  1:  Member  properties 


TYPE 

200 

TYPE 

300 

20  cm 

dia.  X . 

1 cm  Round  Tube 

20 

cm 

dia  X 

. 05  cm 

Round  Tube 

A = 

6.250  X 

lO”'*  m^ 

A 

ss 

3.133 

X 10"^ 

m2 

I = 

3.075  X 

10-®  m^ 

I 

= 

1.559 

X 10"® 

m^ 

J = 

6.1,89  X 

lO"®  m^ 

J 

s 

3.118 

X 10"® 

m^ 

TYPE 

400 

TYPE 

500 

25  cm 

dia  X .05  cm  Round  Tube 

40 

cm 

dia  X 

.075  cm  Round  Tube 

A = 

3.919  X 10“^  m2 

A 

ss 

9.407 

X 10"^  m2 

I = 

3.0496  X 10“®  m^ 

I 

= 

1.874 

-5  4 

X 10  m^ 

J = 

6.099  X 10"®  m^ 

J 

sst 

3.749 

-5  4 

X 10  ^ m^ 
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EQUIPMENT  SECTION  S SOLAN  PANELS 


NODE  LOCATIONS  (m) 

N<M)E 

X 

y 

2 

42 

0.0 

5.0 

0.0 

43 

-2.0 

0.0 

0.0 

44 

0.0 

0.0 

0.0 

45 

2.0 

0.0 

0.0 

46 

-4,0 

-5.0 

0.0 

47 

4.0 

-5.0 

0.0 

48 

-26.0 

0.0 

0.0 

49 

-21.0 

0.0 

0.0 

SO 

-16.0 

0.0 

0.0 

51 

-11.0 

0.0 

0.0 

52 

-6.0 

0.0 

0.0 

53 

6.0 

0.0 

0.0 

54 

11.0 

0.0 

0.0 

55 

16.0. 

0.0 

0.0 

56 

21.0 

0.0 

0.0 

57 

26.0 

0.0 

0.0 

EQUIPMENT  SECTION  - ELEMENT  CONNECTIONS 

- NODES  42-47  RIGID  PLATE 

- SOLAR  PANEL  BOOMS 
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NODE  A 

NODE  B 

PROPERTY  # 

131 

48 

49 

500 

J32 

49 

50 

133 

30 

51 

134 

51 

52 

133 

52 

43 

136 

45 

33 

137 

53 

54 

138 

54 

35 

3 39 

55 

56 

140 

56 

57 

> 

k 
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Figure  4.  Equipment  section  and  solar  panels 


If  the  isolator  frequency  is  lower  than  the  lowest  natural  frequencies  of  the  two 
sections,  they  can  be  modelled  by  their  inertia  properties  only.  For  this  sys- 
tem, the  isolator  was  sized  to  have  a frequency  of  0.5  Hz.  The  dashpot  size  has 
not  been  specified  as  it  is  assumed  to  be  part  of  the  overall  control  design. 

The  spring  constants  are  given  in  Table  2. 

3.0  Structural  Analysis 

A model  analysis  was  performed  using  the  NASTRAN  finite  element  program. 
This  model  contains  84  dynamic  degrees  of  freedom,  thus  84  frequencies  and  mode 
shapes  were  extracted.  Table  3 gives  a list  of  frequencies  and  descriptions  of 
the  first  fifty  mode  shapes, 

4.0*  System  Performance  Evaluation 

The  performance  of  this  system  is  measured  by  the  ability  to  maintain  line- 
of-sight  (LOS)  rotation  and  defocus  within  specified  tolerances.  By  making  some 
simplifying  assumptions  a set  of  linear  equations  relating  the  LOS  rotations  and 
defocus  to  the  displacements  of  node  points  in  the  finite  element  model  have  been 
developed.  This  results  in  a performance  evaluation  model  which  has  the  same 
level  of  accuracy  as  the  finite  element  model. 

4.1  ^ LOS  Error  Algorithm 


The  optical  system  used  in  ACOSS  model  #2  is  shown  in  Figure  6,  The  pri- 
mary and  tertiary  mirrors  are  off  axis  sections  of  rotational ly  symmetric  coaxial 
surfaces.  The  secondary  mirror  and  the  focal  surface  are  flat.  We  assume  that 
the  mirrors  maintain  their  nominal  shape  and  that  their  motion  is  a linear  func- 
tion of  the  displacements  of  the  nodes  at  the  support  points.  Using  these  as- 
sumptions and  first  order  optics  equations,  expressions  relating  mirror  transla- 
tions and  rotations  to  the  translation  and  rotation  of  the  image  are  given. 
Translations  and  rotations  of  the  mirrors  are  assumed  to  be  the  displacements  of 
the  vertices  of  the  surfaces,  located  on  the  optical  axis  (Figure  7).  The  dis- 
placements of  the  vertices  can  be  extrapolated  from  the  displacements  of  the  nouc 
points  at  the  mirror  supports. 


The  general  form  of  the  expressions  for  LOS  rotation  and  defocus  is 


LOSX  = 

Y/F 

LOSY 

= X/F 

DEFOCUS  = 

z 

(4-1) 

Where 

X - [-Xp 

- 

Rp  OVp  ^ 

^1 

- 2 Y J] 

(4-2) 

2T-R^ 

< 

II 

1 

< X 

c+ 

- + 
+ Y + 

0-fYi 
^ - 

20Xs  ^+t^ 

+ 20XJ] 

(4-3) 

'p 

2T-R, 

- Yf  - 

ez^Xj 
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Table  2.  Isolator  spring  connectivities 


SPRING  A 

NODE  A 

DOF 

NODE  B 

DOF 

1 

4 

1 

42 

1 

2 

4 

2 

42 

2 

3 

4 

3 

42 

3 

3 

1 

46 

1 

5 

3 

2 

46 

2 

6 

3 

3 

46 

3 

7 

6 

1 

47 

1 

8 

6 

2 

47 

2 

9 

6 

3 

47 



3 

K - 5790  N/m 
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Table  3.  Frequencies  and  mode  shapes 


Description 


rigid  body 
Isolator  Y-rotation 
Isolator  z~rotation 
S.P.  X-Z  plane 
S.P.  X-V  plane 
Isolator  Z-trans 
Isolator  Y-trans 
Isolator  X-traris 
2"’^  S.P.  X-Z  plane 
2"^  S.P.  X-y  plane 
Isolator  X-rotation 


FREO(HZ) 

DESCRIPTION 

FREQ(HZ) 

DEBCKIPTIOM 

26 

8.U 

LEG  TORSION 

1 39 

15.65 

LEO  1 LONER  TRUSS  BENDING 

29 

6.36 

3*^*^  S.P.  TORSION 

40 

16.07 

LEO  BENDING 

30 

8,57 

LEG  TORSION 

41 

16.52 

UPPER  TRUSS  BENDING 

31 

8.81 

3*^**  S.P,  X-Y  PLANE 

42 

16.75 

UPPER  TRUSS  BENDING 

32 

0.81 

3*^^  S.P.  X-E  PIANE 

43 

17.16 

UPPER  AND  LONER  BENDING 

33 

11.35 

4^^  S.P,  X-Y  PLANE 

44 

17.03 

STRUCTURAL  BENDING  X-Y 

2A 

11,50 

LEG  BENDING 

45 

19.07 

LOWER  TRUSS  BENDING 

35 

12.73 

LEG  .BENDING 

46 

23.77 

UPPER  TRUSS  BENDING 

36 

13.58 

4^^  S.P.  X-Z  PLANE 

47 

24.41 

UPPER  4 LONER  BENDING 

37 

13,71 

LEG  BENDING 

4fi 

25.91  I 

UPPER  TRUSS  AXIAL 

38 

14.16 

LEG  4 liWER  TRUSS  KUDING 

49 

1 

26.36 

UPPER  TRUSS  BENDING 

50 

26.43 

S.P.  AXIAL 
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Figure  7.  Three-mirror  system 
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(4-4) 


+ 0Y^  • Xj  - 0Xf  • Yj 

the  terms  X^ , Y^. , Z^ , 0X^. , 0Z^.  refer  to  the  translations  and  rotations  in  the  glo- 
bal X,  Y,  and  Z directions  of  the  primary  (p)  and  secondary  (s),  tertiary  (t), 
and  focal  plane  (f).  The  variables  in  the  equations  which  relate  to  the  size  of 
the  optical  system  are 


radius  of  curvature  of  the  primary 

radius  of  curvature  of  the  tertiary 

axial  distance  between  the  primary  and  the  secondary 

axial  distance  between  the  secondary  and  the  tertiary 

location  of  a point  on  the  focal  plane 

focal  length 


t P 

2(2T-R^) 

K 

— ^ + t 
2 ^1 


+ tr 


the  location  of  the  image  plane  is  a function  of  the  parameters  of  the  three-mir- 
ror optical  system.  The  distance  BF  between  the  tertiary  mirror  and  the  image 
plane  (see  Figure  7)  is  given  by 


BF  = 2^  (4-5) 

and  for  all  cases  the  focal  surface  must  be  placed  at  this  location,  or,  alter- 
nately the  radius  of  curvature  of  the  tertiary  should  be  calculated  using 
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2T*BF 
t ’ T + BF 


(4-6) 


so  that  the  final  image  coincides  v/ith  the  focal  plane. 

To  simplify  implementation  we  can  rewrite  the  previous  equations  in  matrix 
form.  Equation  4-1,  relating  the  line-of-sight  errors  to  the  translational  de- 
focus terms  becomes 


(4-7) 


where:  X,  Y,  Z - translational  defocus  terms  defined  in  Eq., (4-2)  to  (4-4) 


[A] 


0 1/F  0 

1/F  0 0 

0 0 1 


Equations  (4-2),  (4-3),  and  (4-4)  can  be  rewritten  in  matrix  form  as 


|y|  = [B]  {U^}  (4-8) 

where:  = vector  of  mirror  vertex  motions 

B = matrix  relating  to  translational  defocus  terms  (X,  Y,  Z) 

The  optical  system  used  in  ACOSS  model  #2  has  the  following  parameters: 

Rp  = 53.9  m 

= 30.8  m 

t^  = = 20.0  m 

Xt  = Y = 0.0  m 

I I 

F = 8.051  m 

T = 66.95  m 

The  radius  of  curvature  of  the  tertiary,  R^,  was  found  using  Eq.  (4-6)  and  the 
requirements  that  BF  = 20.0  m. 
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These  parameters  are  then  used  to  form  the  terms  in  matrices  A and  B. 

4.2  LOS  Error  Algorithm  Implementation 

The  motion  of  the  vertex  of  each  mirror  can  be  extrapolated  from  the  dis- 
placements of  the  support  points  since  all  mirrors  are  assumed  to  be  rigid.  The 
mirrors  are  supported  by  kinematic  mounts  so  that  there  is  no  elastic  deformation 
in  the  mirrors.  A displacement  transformation  matrix  which  relates  the  motion  of 
each  mirror  to  the  displacements  of  the  support  points  in  the  global  coordinate 
system  can  be  constructed.  The  relationship  is  of  the  form 


whfere:  = vector  of  mirror  displacements  and  rotations 

= vector  of  displacements  at  support  node  points 

C = displacement  transformation  matrix 

The  final  form  of  the  equations  relating  LOS  errors  to  displacements  at  the 
support  node  points  is 


/L0SX\ 

|lOSY\  = [A]  [B]  [C]  {IJ^>  (4-10) 


using  terms  that  have  been  previously  defined.  If  modal  superposition  is  used  in 
the  analysis,  Eq.  (4-10)  can  be  rewritten  as 


1 

' LOSX'j 
1 LOSY  > 

[ z ) 

[<()L0s^  {ri} 

where: 

{n} 

= modal  displacements 

1 — I 

o 
1 

-e- 

= [A]  [B]  [C]  [4.] 

This  matrix,  is  a simple  way  to  relate  LOS  errors  to  the  modal  displace- 

ments which  will  be  the  result  of  the  analysis  if  a modal  model  is  used. 

5 . 0 Conclusion 

Further  information  on  this  model  is  contained  in  CSDL  Report  C-5437. 


209 


This  Page  Intentionally  Left  Blank 


TRAVELLING  WAVE  EFFECTS  IN  LARGE 
SPACE  STRUCTURES 

Andy  von  Flotow 

Stanford  University 
Stanford,  CA  94305 


Abstract; 

This  paper  investigates  several  aspects  of  travelling  waves  in  Large  Space  Structures(LSS).  The 
dynamic  similarity  among  LSS’s,  electric  power  systems,  microwave  circuits  and  communications 
networks  is  noted.  The  existence  of  time  lags  between  actuation  and  response  is  illuminated  with  the 
aid  of  simple  examples,  and  their  prediction  is  demonstrated.  To  prevent  echoes,  communications 
lines  have  matched  terminations;  this  idea  is  applied  to  the  design  of  dampers  of  one  dimensional 
structures.  Periodic  structures  act  as  mechanical  band  pass  filters.  Implications  of  this  behaviour 
are  examined  on  a simple  example.  It  is  noted  that  the  implication  is  twofold;  continuum  models  of 
periodic  lattice  structures  may  err  considerably;  on  the  other  hand,  it  is  possible  to  design  favourable 
transmission  (and  resonance)  characteristics  into  the  structure. 


Introduction; 

Large  space  structures  have  been  proposed  with  dimensions  of  many  kilometers  (1),  (2).  More 
realistically,  for  the  near  future,  reflectors  and  antennas  with  dimensions  as  large  as  100  metres 
are  planned  (3).  Some  of  these  structures  will  be  composed  entirely  of  networks  of  long,  slender, 
essentially  one  dimensional  structural  members.  Others  will  have  their  flexibility  concentrated 
predominantly  in  one  dimensional  appendages  and  components.  These  components  will  be  flexible 
to  the  point  where  several  of  the  member’s  frequencies  of  free  vibration  will  be  within  the  frequency 
range  of  interest.  For  such  structures,  it  becomes  reasonable  to  visualize  the  response  to  a disturbance 
as  consisting  of  elastic  waves,  slowly  travelling  along  structural  members,  reflecting  and  transmitting 
at  nodes  and  discontinuities.  This  view  becomes  ever  more  valid  as  the  disturbance  is  concentrated 
in  space  and  time. 

The  limit  of  several  free  vibration  frequencies  of  individual  elements  being  within  the  frequency 
range  of  interest  has  been  reached  in  several  relevant  fields  within  electrical  engineering.  Microwave 
circuits  (4)  typically  operate  at  wavelengths  of  a few  millimetres  with  circuit  elements  as  large  as 
a few  centimetres.  Power  distribution  networks  (5),  have  electrical  wavelengths  of  3000  miles  (at 
60  Hz),  while  circuit  dimensions  of  the  same  scale  are  being  used.  Communications  lines,  with 
wavelengths  as  small  as  a few  miles  (corresponding  to  a 10  KHz  voice  transmission)  and  line  lengths 
of  many  hundreds  of  miles  are  an  extreme  example  (6).  In  these  fields,  a balanced  approach  to 
dynamic  analysis  is  used.  System  behaviour  is  investigated  with  both  travelling  wave  and  modal 
methods.  The  importance  of  travelling  wave  concepts  is  greatest  in  communication  line  analysis, 
where  the  wavelength  to  circuit  dimension  ratio  is  smallest. 

There  is  a relatively  clear  conceptual  relationship  between  travelling  wave  and  modal  analysis. 
Modes  of  free  vibration  are  derivable  from  a travelling  wave  analysis  by  invoking  the  phase  closure 
principle  (7).  A travelling  wave,  in  one  complete  circumnavigation  of  the  structure,  must  close  on 
itself  with  a total  phase  change  (due  to  travelled  distance  and  reflections)  of  2m!.  For  one-dimensional 
problems  the  phase  closure  principle  can  yield  the  modes  and  frequencies  of  free  vibration  with 
minimum  effort.  This  paper  concerns  itself  with  insights  gained  into  the  dynamic  behaviour  of 
one-dimensional  structural  elements  when  considered  from  a travelling  wave  perspective. 
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Travelling  Wave  Concepts 

Allideal  one-dimensional  structural  member  is  described  by  a system  of  partial  differential 
equations  together  with  appropriate  boundary  conditions.  The  system  of  PDE’s  may  have  arbitrarily 
high  order  in  space  and  time.  The  simplest  well  known  equation 


d'^U  _ d^U 
dt^  dx^ 


f(x,  t) 


(1) 


modelling  torsion  and  longitudinal  compression  of  solid  sections  and  transverse  motion  of  cables, 
is  second  order  in  space  and  time.  The  well  known  Bernoulli-Euler  beam  model  is  fourth  order  in 
space,  second  order  in  time,  while  the  Timoshenko  beam  model  is  fourth  order  in  both  space  and 
time  (8).  Continuum  models  for  lattice  beams  have  been  proposed  (9,  10)  which  are  large  as  eighth 
order  in  space  and  eighth  order  in  time. 

Assumption  of  a travelling  wave  solution  of  the  form 


U{x, 


(where  U is  a vector  of  deflection  variables,  K is  the  wave  number,  w the  circular  frequency)  and 
substitution  into  the  homogenous  system  of  partial  differential  equations  yields  a polynomial  matrix 
equation 

\Pij{i^,K)\Uo  ==  0 

The  condition  det  [Py]  = 0 yields  a polynomial  frequency  equation  in  w and  K,  and  the 
associated  eigenvectors  [/<,  give  participation  of  the  various  deformation  variables  in  the  travelling 


wave.  The  identity  u — Kcp  (where  Cp  is  phase  speed)  can  be  used  in  the  frequency  equation  above 
to  arrive  at  the  equivalent  dispersion  equation,  a polynomial  in  Cp  and  K. 

To  clarify  the  above  discussion,  and  to  introduce  the  phenomenon  of  dispersion,  the  following 
two  examples  are  given. 

Cable  on  Elastic  Foundation  (8): 


Governing PDE : 


d^u 

W 


(2) 


Frequency  equation  : 


-f  A — = 0 (/laurel  ) 


Dispersion  equation  : K^{\  — <^)  + A = 0 

Also  indicated  in  figure  1 is  the  use  of  the  phase-closure  principle  to  determine  modal  fre- 
quencies of  a finite  structure,  directly  from  the  frequency  vs  wave  number  curve. 

Timoshenko  Beam  (8): 


Governing  PDE’s  : 


dx^  ^ J 


= 0 
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where  7 is  relative  shear  flexibility,  y is  lateral  deflection,  ^ — gj  is  shear  deformation. 

Shear  deformation  effects  may  be  neglected  by  setting  7 = 0.  The  resulting  system  is  that 
studied  by  Rayleigh,  the  rotary  inertia  term  being  instrumental  in  producing  a more  realistic 
dispersion  curve  than  the  Bernoulli-Euler  equation  (figure  3b). 

Assumption  of 


leads  to  the  matrix  equation: 

-i^2+u;2-l/7  iKh 
iKj'i 

Thus  the  frequency  equation  is  (figure  2 ); 

A"*  — + 7)  + w^(w^7  — 1)  = 0 


(4) 


FREOmiNCY  VS  WAV!:  NlfBER  FOR  A CABLE  ai  AN 
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and  the  dispersion  equation  is  (figure  3b); 


K^{\  - c^(l  + 7)  + 4'f)  = ^ 

The  comparison  in  figures  2 and  3b  with  the  Bernoulli-Euler  beam  model  should  be  noted. 
The  Bernoulli-Euler  beam  model  predicts  infinite  phase  speeds  for  infinitesimal  wave  numbers.  Since 
the  phase  speeds  determine  the  model  frequencies  (from  the  phase-closure  principle),  the  higher 
frequencies  tend  to  be  over-predicted.  Timoshenko  models  of  typical  built-up  beams  (11)  have  shown 
them  to  have  non-dimensional  shear  stiffness  i much  less  than  unity.  Thus,  for  these  structures,  the 
shear  deformation  included  in  the  Timoshenko  beam  model  becomes  important  surprisingly  early. 


Dispersion,  Velocity  DeflnitionB 

A medium  for  which  the  phase  speed  Cp  = w/K"  is  dependent  on  frequency  (or  wavelength) 
is  called  dispersive.  For  such  a medium  the  concept  of  signal  propagation  may  become  poorly 
defined.  The  signal  distorts  as  it  propagates,  and  may  become  unidentifiable.  For  these  media, 
other  velocities  may  be  meaningfully  defined.  Group,  signal,  and  energy  transmission  velocities 
have  all  been  studied  in  the  literature  (12).  Group  velocity,  also  a function  of  frequency,  is  the 
velocity  with  which  a group  of  harmonic  disturbances  of  almost  identical  frequency  will  propagate. 
Signal  velocity  is  loosely  defined  as  the  speed  at  which  the  first  detectable  portion  of  a disturbance 
propagates.  Energy  transmission  velocity,  also  loosely  defined,  is  the  average  speed  at  which  the 
bulk  of  the  energy  in  a disturbance  is  propagated.  For  a non-dispersive  medium,  all  velocities  are 
equal  to  the  phase  velocity,  and  independent  of  frequency. 

It  has  been  shown  (12)  that  the  signal  and  energy  transmission  velocities  are  equal  to  the 
group  velocity  when  the  frequency  equation  is  linear  in  w and  K.  Moreover,  when  this  restriction 
is  almost  satisfied,  or  when  the  disturbance  has  a frequency  decomposition  primarily  restricted  to 
the  linear  portions  of  the  frequency  equation,  the  group  velocity  determines  speed  of  propagation  of 
disturbances.  Only  in  case  of  anomalous  dispersion  characteristics  must  other  speeds  be  considered. 
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As  has  been  stated,  the  group  velocity  is  the  speed  of  propagation  of  a group  of  harmonic 
waves  of  only  slightly  varying  frequencies.  It  is  dehned  as  cq  = and  can  be  calculated  from  the 
frequency  equation.  The  group  velocity  of  the  two  examples  introduced  above  is  graphed  in  figures 
3a  and  3b. 

Exact  solutions  exist  for  some  loading  conditions  in  both  these  examples.  The  impulse  response 
of  the  cable  on  an  elastic  foundation  is  a travelling,  distorting  Bessel  function,  whose  discontinuous 
front  travels  with  the  maximum  group  velocity  (8),  c = 1 (refer  to  fig.  3a).  The  transient  response 
of  the  Timoshenko  beam  model  admits  some  closed-form  solutions  (13).  Two  limiting  velocities  are 
identified;  discontinuities  in  shear  force  and  lateral  velocity  propagate  with  nondimensional  velocity 
c = 1,  while  discontinuities  in  bending  moment  and  angular  velocity  travel  with  non-dimensional 
velocity  c — 1/^7  (refer  to  fig  3b).  Although  no  proof  is  known,  it  is  proposed  to  generalize  these 
results,  and  to  make  the  following  claim  for  media  with  smooth,  quasi-linear  frequency-wave  number 
relationships;  disturbances  travel  with  a velocity  no  greater  than  the  maximum  group  velocity. 

Time  Lags; 

An  effect  clearly  illuminated  by  a travelling  wave  approach  is  the  existence  of  finite  time  lags 
between  actuation  and  response  when  actuator  and  sensor  are  not  co-located.  Actually,  due  to 
multiple  wave  paths  and  reflection,  many  time  lags  will  be  present,  however,  experience  indicates 
that  in  most  real,  dissipative  and  dispersive  structures,  only  the  first  lag  will  be  clearly  observable. 
Since  the  response  time  lags  depend  on  the  structure’s  dimension  and  on  group  velocity,  while 
modal  periods  depend  on  dimension,  boundary  condition,  and  phase  velocity,  one  can  expect  lags 
independent  of  modal  periods,  possibly  much  larger  than  the  period  of  the  lowest  mode. 

Recent  experiments  with  flexible  structures  have  demonstrated  significant  lags,  on  the  order 
of  modal  periods  of  the  second  and  third  modes  (2,  14).  A flexible  built-up  beam  at  Stanford 

University  (1.1  metre  long)  exhibited  lags  in  tip  deflection  response  to  root  torques  of  approximately 
100  milliseconds  (14).  This  compares  with  the  period  of  the  third  flexible  mode,  measured  as  110 
milliseconds.  Figure  4a  gives  a rough  description  of  the  beam  and  relevant  experimental  results. 

Tiw;  1A5  EXimE 


(STANFCRD  FLEXIBLE  MANIPULATCR) 


FiaKE  % 
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Figure  4b  shows  the  dispersion  curve  of  a Timoshenko  model  of  the  Stanford  beam.  The  four  beam 
parameters  were  estimated  by  smearing  stiffness,  mass  and  rotary  inertia,  then  adjusted  to  predict 
the  first  two  experimentally  measured  clamped-free  modes  of  vibration. 

The  applied  root  torque  enters  the  beam  as  a bending  moment,  and  the  disturbance  can  be 
expected  to  travel  (13)  with  non-dimensional  limit  velocity  c = 1/^7,  or  from  figured,  c — 8.5 
metre/sec.  Thus,  the  calculated  lag  is  129  milliseconds. 

Matched  Terminations; 

An  idea  which  originates  in  the  field  of  communications  is  that  of  a matched  termination  for  a 
transmission  line  (6).  Such  a termination  prevents  reflection  (echoes)  by  imitating  the  behaviour  of  a 
semi-infinite  continuation  of  the  transmission  line.  The  resulting  response  lacks  modes  of  vibration 
entirely,  and  the  line’s  transfer  function  may  be  simplified  to  that  of  a single  integrator. 

The  idea  of  matched  terminations  can  be  extended  to  the  design  of  active  or  passive  dampers  at 
the  boundaries  of  structural  elements.  For  the  one-dimensional  structures  considered  in  this  paper, 
the  matched  termination  is  expressed  as  a set  of  transfer  functions  relating  tip  displacements  and 
applied  forces.  The  practical  implementation  of  the  required  transfer  functions  may  be  difficult; 
however  the  idea  provides  a basis  for  passive  damper  design.  Two  examples  involving  simple 
structural  models  follows. 

Matched  Termination  Example  1; 

Simple  wave  equation;  model  of  torsion  and  longitudinal  compression  of  solid  section,  transverse 
motion  of  tensioned  cable; 


Incident 


Reflected 


U=  R 


IKx+st 


|— » U(0,s) 


LET  H(s) 


F(s) 

U(O.s) 


PDE: 


d^u  _ p d^u 
5^2  “ eW 


BC  ; 


EA 


du(0,  S) 
dx 


= F(5) 


where  p is  the  material  mass  density,  E is  the  modulus  of  material,  and  A is  the  cross-sectional 

area. 

Application  of  boundary  condition  yields  for  the  reflection  coefficient 


H{S)AiKEA 
iKEA  - H(S) 

but  the  PDE  imposes  the  frequency  equation; 


iK  = yJJfE  S 


^_H(S)  + A^S 
Ay/^S  - H(S) 

for  H(S)  = —As/EpS  (a  daspot)  R — 0 ( no  reflection  occurs). 
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Thus,  a properly  sized  dashpot  yields  a matched  termination  for  this  class  of  problems.  To 
study  the  effect  of  mismatched  terminations  on  the  transfer  function  of  a finite  structure,  the  state- 
vector  transition  matrix  method  of  Pestel  (15)  is  employed.  This  method  proves  valuable  in  the 
analysis  of  periodic  structures  and  is  introduced  in  this  example.  With  reference  to  figure  5,  the 
deformation  of  a rod  (of  length  i,  area  A,  modulus  E)  in  longitudinal  compression  is  described  by 
state  vector  [U,NY  where  U is  non-dimensional  displacement  Uf£,N  is  non-dimensional  internal 
force  N/EA. 

The  state  vectors  at  two  points  in  a structure  are  related  by  the  transition  matrix 
For  the  rod,  the  transition  matrix  is  given  by 


cosh  s’ 
^ slnh  s 


where  s •= 


Note  that  the  state  vector  at  points  B and  C may  be  related  by  another  transition  matrix  (15): 


So,  multiplication  of  matrices  yields  the  global  transition  matrix: 


, - . s slnh  s 

slnh  s 1 cosh  s 

cosn  s + . ^ . 

Y + 8 A 

Y + s A 

-2  , . _ . Is  slnh 

s y(  cosh  6 + 

Y + S 

A 

_2  , slnh  8 . cosh  s v 
8 p(  - + ) 

Y + s'  A 

+ s’  slnh  's 

+ cosh  s 

where  n — mfpAl  (mass  fraction) 

7 = KfljEA  (stiffness  ratio) 

A = dfAy/Ep  (non-dimensional  damping  constant) 

Various  transfer  functions  can  be  extracted  from  the  above  matrix  equation.  In  the  present 
case,  Ne  ==  0,  so  the  second  row  gives  the  transfer  function 
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N,(») 


For  the  matched  termination  calculated  above,  (7  = 0,  A ==  1,/r  — » 00),  this  becomes 


Ua{S)  _ 1 

a single  integrator,  also  (with  a redefinition  of  the  length  non-dimension alisation)  the  transfer 
function  of  a semi-infinite  rod. 

The  poles  and  zeroes  of  the  transfer  function  (1)  are  shown  in  figures  6.  Figures  6a,  b show  the 
effect  of  damper  mismatch,  with  an  inertial  reference  point  available  (infinite  mass  ratio).  Figure  6c 
uses  more  realistic  mass  ratios  of  1%  and  5%  and  shows  the  root  locus  versus  damper  strength.  The 
optimum  damper  strength  is  seen  to  be  a function  of  mass  ratio  and  mode  to  be  damped.  Modal 
damping  ratios  as  large  as  1%  can  be  obtained  with  a 1%  mass  penalty.  The  picture  could  be 
complicated  by  inclusion  of  a spring  7 #=  0,  however,  such  a search  for  an  optimum  tuned  damper 
has  been  described  elsewhere  (16,17). 


Matched  Termination  Example  2; 


Bernoulli-Euler  beam  model: 


PDEj  + ^ =0 

ax'*  3t2 

EC's;  (at  x-0)  E I - — ^ - f 
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Note  that  the  PDE  admits  two  types  of  time-harmonic  solutions,  one  a travelling  unattenuated 
wave,  the  other  a near  field  effect,  attenuated  with  distance.  Assume  that  an  external  moment  M 
and  force  f can  be  applied  in  response  to  end  defiection  and  slope 

dv 

M(w)  = Mo(w)j/(0,w)-|-  Mi(w)~(0,w) 

dv 

/(w)=  /o(w)j/(0>w)-H/j(w)^(0,w) 

This  assumption  is  in  the  interest  of  practicality.  If  moments  proportional  to  curvature  and 
forces  proportional  to  could  be  applied,  it  would  become  easy  to  imitate  the  behaviour  of  a 

semi-infinite  continuation.  Using  the  assumed  solution 

y(x,t)  = + + Ce-^*e-"‘ 

and  application  of  the  above  boundary  line  yields 


+ iK«j  + E I - Mj  k - E I / b\  + ikMj  - E I 

fo  + IK  fj  + iE  I - fjK  + E I \C/  ~ -fg  + 1 fj  + E I 


Combinations  of  boundary  transfer  functions  Mo,Mi,/o,/i,  which  yield  B = C = 0 are 
perfectly  matched  boundary  conditions.  If  C = 0,  reflection  occurs,  however  the  reflected  signal 
does  not  propagate.  Figure  7 presents  some  candidate  combinations.  All  choices  involve  springs 
or  dashpots  which  are  dependent  on  disturbance  frequency.  A square  root  frequency  dependence 
is  approximated  by  many  types  of  rubber  (17),  however,  an  dependence  is  likely  to  present 
difficulties.  A practical  implementation  of  these  ideas  would  likely  use  the  above  matched  termina- 
tions only  as  guidelines  in  a more  comprehensive  damper  design. 


SPRINS) 


c 0 0 c * 0 
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Periodic  Structures 


A periodic  one-dimensional  wave  guide  exhibits  anomalous  dispersion  characteristics  (18).  The 
frequency-wave  number  relationship  has  a banded  structure,  with  passing  and  stopping  bands.  There 
will  be  as  many  passing  bands  as  there  are  degrees  of  freedom  in  the  elementary  cell,  disturbances 
at  frequencies  outside  these  passing  bands  are  spacially  attenuated.  A continuous  wave  guide  with 
periodically  attached  resonators  will  exhibit  two  types  of  structure;  there  will  be  stopping  bands  due 
to  the  periodicity,  and  stopping  bands  due  to  the  poles  and  zeroes  of  the  attached  resonant  systems. 

These  effects  can  make  continuum  modelling  of  periodic  lattice  beams  difficult.  Figure  8, 
taken  from  a paper  by  Anderson,  (19)  compares  modal  frequencies  for  a pinned-pinned  lattice 
beam  calculated  in  two  ways.  The  equivalent  continuum  beam  model,  adequate  for  description  of 
static  deformation,  errs  seriously  (by  300%  in  mode  4),  in  predicting  modal  frequencies.  Anderson 
discovered  that  “the  reason  for  this  result  is  that  the  clamped  end  frequency  of  the  diagonal  members 
is  12.3  Hz  which  becomes  an  upper  limit  for  the  lowest  frequency  for  each  n number.” 


The  essential  features  of  wave  propagation  and  resonances  in  a continuous-periodic  structure 
are  demonstrated  by  the  following  example.  The  system  to  be  studied  consists  of  a uniform  rod  in 
longitudinal  compression. 
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It  is  loaded  by  identical  resonant  systems  periodically  attached.  Using  the  state  vector  representation 
and  symbols  of  a previous  example,  the  transition  matrix  corresponding  to  an  elementary  cell  is 


lu] 

1+1 

cosh  s + H(s-)  i) 

® 2s 

i slnh  S'  + (1  + cosh  s)  cosh  s + H(i)  ^ 


where  s *=  s 

U = U/£ 

h = h£/ea 
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Symbolically,  the  approach  to  obtaining  the  poles  and  zeroes  of  a finite  periodic  structure  would 
consist  of  the  same  steps  used  in  the  previous  example.  The  global  transition  matrix  is  obtained 
by  multiplication  of  local  matrices.  The  boundary  conditions  are  used  to  extract  the  frequency 
determinant  or  transfer  function  from  the  global  matrix.  The  transfer  function  is  searched  for  its 
poles  and  zeroes.  This  approach  leads  to  numerical  difficulties,  and  a more  intuitive  method  using 
the  phase-closure  principle  is  described. 

Wave  Propagation  along  Second  Order  Periodic-Continuous  Structures: 

A second  order  continuous  structure  with  periodically  attached  resonant  systems  will  have  a 
2x2  elementary  transition  matrix 


^21  ^22 


A single  wave  that  propagates  along  this  infinite  structure  is  characterized  by  the  fact  that  the 
state  vector  is  multiplied  by  a complex  factor  ^ as  the  wave  passes  through  the  cell 


ru\ 

- 

= t 

i+1 

Thus,  the  transmission  coefficient  ^ may  be  determined  as  the  eigenvalues  of  the  local  transition 
matrix 

— (Til  + T2z)(  -1-1  = 0 

(where  the  energy  conservation  property,  det  [T]  = 1,  valid  for  conservative  systems,  has  been  used). 
This  quadratic  has  the  property  that  its  two  roots  satisfy  ^1^2  = 1-  Thus,  if  = e~'^  = 
then  ^2  = Here  a is  the  attenuation  coefficient  per  cell,  ^ the  phase  change.  Unattenuated 

propagation  occurs  if  a = 0 i.e.  iTn  -t- T22I  < 2.  When  spacial  attenuation  occurs,  resonances  in 
finite  structures  are  prevented. 

In  the  second  order  case  described  above,  it  is  interesting  to  note  the  exact  analogy  with  Floquet 
theory  and  Hill’s  equation  (20). 

The  procedure  outlined  above  was  used  to  calculate  the  frequency-wave  number  relationship 
for  the  example  structure.  Point  transfer  functions  H(S)  representing  rigidly  attached  masses  and 
elastically  attached  masses  were  considered.  Results  are  graphed  in  figures  9.  These  figures  display 
the  banded  frequency  distribution.  The  first  branch  of  the  curve  in  figure  9b  should  be  compared 
with  Anderson’s  results  in  figure  8.  Anderson  also  mentions  the  existence  of  the  higher  branches,  but 
does  not  show  them.  The  existence  of  this  banded  resonant  structure  is  a hindrance  to  continuum 
modelling  of  lattice  beams.  However,  it  also  provides  opportunities  to  the  designer.  With  prescribed 
tuning,  the  stopping  bands  may  be  placed  where  desired.  Resonance-free  frequency  ranges  may  be 
designed  into  structural  members  and  used  for  vibration  isolation  or  control  spillover  amelioration. 
In  this  model  the  width  of  the  first  stopping  band  depends  on  the  mass  ratio  and  on  the  tuning. 

Since  the  elastically  attached  mass  is  a simplified  representation  of  internal  lattice  members, 
it  is  difficult  to  speak  of  a weight  penalty.  However,  in  this  model,  a mass  penalty  of  5%  (figure  9c) 
yielded  a resonant-free  frequency  range  of  relative  bandwidth  15%.  Inclusion  of  light  damping  in 
the  spring-mass  system  would  potentially  double  the  effective  resonant-free  range. 
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Summary; 

This  paper  has  examined  various  aspects  of  travelling  wave  effects  in  one-dimensional  struc- 
tures. The  existence  of  time  lags  was  indicated,  and  a simple  method  of  their  prediction  was 
demonstrated.  The  possibility  of  passive  damper  design  based  on  the  matched  terminations  of 
communications  engineering  was  explored  with  the  help  of  two  examples.  Insight  was  gained  into 
the  continuum  modelling  of  periodic  lattice  beams  and  the  possibility  of  custom-designing  desirable 
transmission-resonance  characteristics  was  pointed  out. 
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ABSTRACT 

In  this  paper,  a systematic  way  to  choose  control  design  parameters  and  to 
evaluate  performance  for  large  space  antennas  is  presented.  The  structural 
dynamics  and  control  properties  for  a Hoop  and  Column  Antenna  and  a Wrap-Rib 
Antenna  are  characterized.  Some  results  of  the  effects  of  model  parameter 
uncertainties  to  the  stability,  surface  accuracy,  and  pointing  errors  are 
presented.  Critical  dynamics  and  control  problems  for  these  antenna 
configurations  are  identified  and  potential  solutions  are  discussed. 

I.  INTRODUCTION 

Large  space  antennas  and  other  large  space  structures  will  play  an  impor- 
tant role  in  the  coming  decades  as  commercial  applications  of  space  become 
feasible  - especially  in  the  area  of  communications.  Structures  from  10  m to 
120  m and  larger  have  been  considered  by  NASA  and  other  government  agencies  for 
future  missions  (Reference  1).  As  the  structural  size  and  mass  distribution 
change  drastically  from  that  of  conventional  spacecraft,  many  difficult  control 
problems  arise.  The  basic  problem,  however,  comes  in  modeling  highly  flexible 
structures.  Structures  of  this  type  are  known  to  have  a large  number  of  packed 
modes  at  very  low  frequencies;  mode  shapes  and  frequencies  can  not  be  accurately 
predicted  or  measured  even  for  a small  number  of  modes  at  preflight  time. 

This  means  that  there  are  always  model  uncertainties.  Model  uncertainties  in 
the  control  loop  can  cause  serious  consequences  including  the  possibility  of 
making  the  system  unstable.  Model  order  is  another  problem  with  great  impact 
in  control  design.  Using  today's  inflight  computer  capability,  one  can  only 
expect  to  have  a modest  low  order  controller  - which  often  means  model  trunca- 
tion and  will  cause  further  performance  deterioration.  Dynamics  and  control 
problems  for  specific  configurations  must  be  characterized  and  evaluated  in 
terms  of  Incomplete  knowledge  of  the  system  dynamics  so  that  the  required  per- 
formance can  be  ensured  and  risk  reduced. 

In  this  paper,  the  structural  dynamics  and  the  control  properties  for  a 
64-meter  diameter  center  fed  antenna  and  a 55-meter  offset  fed  antenna  are 
Investigated.  Some  interesting  results  are  presented  on  the  effects  of  model 
parameter  uncertainties  to  the  system  stability,  surface  accuracy,  and  pointing 
accuracy.  Critical  control  problems  are  identified  and  potential  solutions 
are  recommended.  In  Section  II  the  antenna  configurations  and  the  structural 
dynamic  properties  are  briefly  described.  The  control  design,  disturbance 
assessment,  and  construction  of  weighting  matrices  are  summarized  in  Section 
III.  Finally,  numerical  results  and  conclusions  are  given  in  Sections  IV  and 
V,  respectively. 
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II.  ANTENNA  CONFIGURATIONS  AND  STRUCTURAL  DYNAMIC  PROPERTIES 


A.  Antenna  Configurations 

Several  antenna  concepts  are  under  development  (References  1,2,3).  However, 
there  are  two  concepts  that  have  drawn  particular  attention  in  recent  years; 
they  are  the  structurally  balanced  center  fed  quad  aperture  hoop  and  column 
antenna  and  the  offset  fed  wrap-rib  antenna.  The  hoop  and  column  antenna  chosen 
for  analysis  is  a pretensioned  mesh  deployable  structure.  Figure  1 shows 
the  deployed  100  mD  antenna  and  the  major  components  are  depicted  in  Figure  2. 

The  basic  structural  components  are  the  feed  assemblies,  the  mast,  the  hoop, 
and  the  reflector  mesh  surface.  The  mast  and  the  support  cables  suspend  the 
hoop  and  together  the  assembly  provides  the  stiffness  of  the  structure.  The 
mesh  is  suspended  by  the  hoop  and  the  mast  and  the  surface  shape  is  controlled 
by  the  shape  cables  through  a secondary  draw  surface  and  tie  cords.  The  cables 
are  stranded  quartz  cords.  The  hoop  consists  of  48  hollow  sections  made  of 
graphite  fiber.  The  adjacent  sections  are  joined  together  through  hinges.  The 
reflector  is  made  of  gold  plated  poly-wire  mesh.  The  four  circular  surface 
areas  are  separately  illuminated  by  the  feed  elements  that  form  the  offset 
feed  quad-aperture  arrangement. 


Figure  1.  Deployed  100  mD  Hoop  and  Column  Antenna. 
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Figure  2.  Side  View  of  100  mD  Hoop  and  Column  Antenna. 

The  wrap-rib  antenna  consists  of  a 55-meter  diameter  mesh  reflector,  a 
feed  array  mounted  on  a spacecraft  bus,  and  a long  L-shaped  boom  connecting 
the  reflector  and  the  bus.  A typical  configuration  is  shown  in  Figure  3. 

The  reflector  consists  of  a number  of  radial  ribs  which  are  cantilevered 
from  a central  hub  structure.  For  parabolic  or  other  curved  shape  reflectors, 
the  ribs  are  formed  in  accordance  with  the  required  surface  shape,  and  the  mesh 
gores  are  attached  between  the  ribs. 

B.  Structural  Dynamic  Properties 

The  total  weight  of  the  hoop  and  column  antenna  system  is  2790  kg  of  which 
the  feed  assembly  weighs  approximately  30%,  the  hoop  and  the  mast,  20%,  and  the 
solar  panels  and  the  spacecraft  weigh  50%.  Since  mass  distribution  affects 
modal  frequencies,  it  is  appropriate  to  point  out  that  based  on  a subsequent 
study  of  a 122  mD  LMSS  (Land  Mobile  Satellite  Service)  antenna  a much  heavier 
(approximately  50%)  mass  concentration  at  the  feed  area  is  more  realistic. 

To  assess  performance,  a finite  element  model  of  a 64  mD  hoop  and  column 
antenna  (Reference  4)  has  been  adopted.  The  reflector  and  the  hoop  were  modeled 
by  a two-for-one  model  with  24  gore  and  hoop  sections  and  120  grid  points. 

The  feeds,  the  solar  panels,  the  mast,  etc.,  were  represented  by  86  grid  points. 
Bar  elements  were  used  to  model  the  mast  and  the  feed  and  solar  panels;  and  rod 
elements  for  the  hoop  joints,  hoop  support  and  surface  shaping  cables. 

The  finite  element  analysis  revealed  that  the  lowest  mode  is  the  first 
torsional  mode  which  has  a frequency  of  0.10  Hz;  the  next  higher  modes  are  the 


227 


TOTAL  MASS 


%95  LBS 


two  orthogonal  mast  bending  modes  with  frequencies  0,43  Hzj  the  next  two  modes 
are  the  second  and  third  mast  torsion  modes  which  are  followed  by  two  orthogonal 
second  mast  and  dish  bending  modes.  Figure  4 shows  the  modal  and  mass  proper- 
ties. It  is  interesting  to  note  that  the  antenna  has  relatively  high  bending 
stiffness  and  weak  torsional  stiffness.  The  high  radial  separation  of  the  hoop 
and  the  mast,  the  large  hoop  moment  of  Inertia,  and  the  small  moment  arm  of 
the  hoop  cables  at  the  mast  make  the  mast  very  loosely  coupled  to  the  hoop  so 
that  the  mast  alone  dominates  the  torsional  modes. 

The  total  weight  of  the  wrap-rib  antenna  system  is  4,407  kg  (9695  lb),  80% 
of  which  is  concentrated  at  the  feed/bus  area.  The  remaining  20%  is  attributed 
to  the  reflector  and  the  hub.  Moments  of  inertia  are  3.95x10^,  3.58x10^, 

.502x10^  kg -m2.  The  largest  cross  product  of  inertia  is  .976x10^  kg -m2,  which 
is  due  to  the  Inherent  Imbalance  of  the  offset  feed  configuration.  The  axis  of 
least  moment -of -inertia  is  approximately  along  an  imaginary  line  connecting  the 
bus  and  the  reflector  hub.  This  results  in  a 16°  offset  between  the  least 
moment -of -inertia  axis  and  the  local  vertical. 

Mode  shapes  and  mode  frequencies  of  the  wrap-rib  antenna  system  were 
obtained  in  two  steps.  First,  reflector  modes  and  offset  boom  modes  were 
obtained  from  finite  element  methods.  Second,  the  modes  of  these  two  substruc- 
tures were  then  combined  through  the  process  of  modal  synthesis  on  the  assumption 
that  the  interface  at  the  hub  structure  is  rigid.  Figure  5 provides  the  first 
seven  flexible  mode  frequencies  and  mode  shape  description  of  the  wrap-rib 
antenna  system. 
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• MODAL  FREQUENCIES 


NO. 

FREQ,  HZ 

DESCRIPTIONS 

7 

0.10 

1ST  MAST  TORSION 

8 

0.43 

1ST  MAST  ROLL  BENDING 

9 

0.43 

1ST  MAST  PITCH  BENDING 

10 

0.58 

2ND  MAST  TORSION 

11 

1.07 

3RD  MAST  TORSION 

12 

1.83 

2ND  MAST/DISH  ROLL  BDG 

13 

1.90 

2ND  MAST/  DISH  PITCH  BDG 

14 

3.20 

DISH  WARPING 

15 

3.28 

DISH  WARPING 

16 

3.36 

DISH  WARP  MAST  BENDING 

17 

3.37 

DISH  WARP  MAST  BENDING 

18 

4.43 

DISH  WARP  MAST  BENDING 

• 64  mD  ANTENNA 

• QUAD-APERTURE 


• MASS  PROPERTY 

. MASS:  2790  Kg 
• MOMENT  OF  INERTIA 

1 .42  X 10*  Kg-m^ 


• MAX  DISTURBANCE  TORQUES 

• GRAVITY  GRADIENT  1 .89  x 10'^  Nm 
. t.  'in  ..  ia“4 


• GYROSCOPE 

• SOLAR  PRESSURE 


6.30  X 10 


1.42x10  Kg-m  , SOLAR  PRESSURE  6.23x10'" 

1 .42  X 10* 

2.73  X 10* 

BALANCED  CONFIGURATION 

Figure  4.  Structural  Properties  for  the  64  mD  Hoop  and  Column  Antenna 


FREQ  (Hz) 
0.0872 
0.1473 
0.1965 
0.2062 
0.2201 
0.2906 
0.6644 


DESCRIPTION 
SHORT  BOOM  TORS  ION 
DISH  TORSION,  LONG  BOOM  TWIST 
DISH  TORSION,  LONG  BOOM  BENDING 
DISH  BENDING 

DISH  BENDING,  LONG  BOOM  BENDING 
DISH  TORSION 

DISH  ROTATION,  LONG  BOOM  BENDING 


FEED/BUS 


Figure  5.  Wrap-Rib  Antenna  Mode  Frequencies/Shapes 
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III.  ATTITUDE  AND  STRUCTURAL  CONTROL  SYSTEMS 


A.  Equations  of  Motion 

For  small  perturbations  about  the  equilibrium,  the  dynamics  of  this  system 
may  be  described  by  the  finite  element  method.  For  the  hoop  and  column  system, 
let  q be  the  18-modal  amplitude  vector,  3>  the  1236xl8-eigenvector  matrix,  and  f 
the  vector  of  the  applied  forces.  The  dynamic  equations  are,  in  component  form 


k = 1 , ...  ,6 

(la) 

+ + “k^“k  ■ ♦k’^f  • 

k = 7,. ..,18 

(lb) 

where  the  modal  frequencies,  may  be  obtained  from  Figure  4 and  the  estimated 
values  for  the  damping  ratios,  are  . 01  for  k = 7,..., 10  and  .02  for 
k = 11,... ,18. 

Due  to  the  existence  of  numerical  sensitivity  problems  the  rigid  modes 
were  not  used  and  instead  Euler's  equations  were  used  to  describe  the  rigid 
body  motion.  Referring  to  Figure  4,  let  X,  Y,  Z be  the  body-fixed  coordinates. 
Consider  a nadir  pointing  configuration  with  the  satellite  moving  in  a circular 
synchronous  orbit.  The  Euler's  equations  for  the  rigid  body  rotation  are 


I d + 4to 
x^  o 

I e + 3oj 
y o 

I m ^ 

z o 


^(I  -I  )(j)  + 0)  (I  -I  -I  = T 
'y  z'  ^ o y X z'  ^ 

^(1  -I  )e  = T 

X z 

(I  -I  + 0)  (I  -I  +I  = T 
y X o x y Z 


dx 


dy 


dz 


+ T 


cx 


+ T 

cy 


+ T 

cz 


(2a) 

(2b) 

(2c) 


where  Ij,. , I , I^ , % , , 6,  and  ip  are  the  principal  moments  of  inertia,  the 

orbital  rat^,  and  the  roll,  pitch,  and  yaw  angles,  respectively.  T^j 

Tdz,  Tex,  Tcy,  and  Tez  are  the  disturbance  and  control  torques,  respectively . 
Note  that  since  the  gravity  gradient  torques  and  the  gyroscopic  torques  are 
already  included  in  the  left-hand  side  of  Eq . (2),  they  are  excluded  from  T(j, 
The  angles  <j> , 0,  ^ and  their  derivatives  in  Eq . (2)  are  assumed  small  and  they 
are  related  to  inertial  angular  rates  by  the  following  approximate  relations. 
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o 
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The  equations  of  motion  of  the  wrap-rib  antenna  system  were  developed  in 
Reference  5.  The  approach  there  was  based  on  LaGrange's  formulation.  The  set 
of  equations  developed  reproduces  mode  shapes  and  frequencies  very  close  to  those 
generated  using  finite  element  methods  (Figure  5).  However,  the  approach  in 
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Reference  5 permits  convenient  and  inexpensive  model  parameter  changes. 

The  equations  of  motion  have  the  following  general  form: 

Mx  H-  Kx  = F (4) 

where  x consists  of  3 rigid-body  rotations,  and  boom  and  reflector  flexible 
modes.  M and  K are  the  mass  and  the  stiffness  matrices,  respectively.  And 
F represents  all  non -conservative  forces  on  the  antenna  system.  In  this  study, 

3 boom  modes  and  6 reflector  modes  were  selected  for  analysis,  and  therefore 
Eq.  (4)  is  a 12  X 12  second  order  system. 

To  consider  natural  damping  of  the  system,  the  following  equation  was 
adopted : 

Mx  H-  Dx  + Kx  = F (5) 

T 

where  D = $ [2Zfi]$;  $ and  0 are  the  eigenfunctions  and  eigenvalues  of  Eq,  (4). 

And  Z repres€'.nts  the  diagonal  matrix  of  damping  ratios: 

Z = diag  (?^)  , (6) 

where,  in  this  study,  = 0 for  i = 1,2,3,  and  = .005  for  all  other  i. 

B . Control  Hardware  Placement 

The  beam  pointing  accuracy  is  determined  by  the  orientations  of  the  feed 
and  the  dish  and  their  relative  motions.  Sensor  and  actuator  placement  is 
dictated  by  the  observability  and  controllability  properties  and  the  structural 
constraints.  By  examining  the  eigenvectors  of  the  elastic  modes  at  various 
potential  locations,  it  can  be  seen  that  sensors  and  actuators  placed  at  the 
feed  assembly  and  the  spacecraft  bus  can  most  effectively  control  the  attitude 
and  the  important  elastic  modes.  Figure  6 shows  the  two-site  3-axis  attitude 
and  structural  control  system  for  the  hoop  and  column  configuration  considered 
in  this  paper.  Basic  control  devices  are  the  inertial  sensors  and  momentum 
wheels.  Reflector  shape  control  can  be  achieved  with  the  addition  of  optical 
sensors  such  as  SHAPES  (Reference  6)  and  shape-cable  actuators.  Hoop  motion  can 
be  controlled  with  a SHAPES  sensor  and  thrusters  on  the  hoop  sections. 

Three  options  in  control  hardware  placement  have  been  considered  for  the 
wrap-rib  ante-.nna.  Option  1 has  all  attitude  sensors  and  actuators  lumped  to- 
gether and  mounted  on  the  bus.  This  concept  represents  current  attitude 
controllers  for  3-axis  spacecraft  stabilization. 

Option  2 represents  a departure  from  option  1 in  that  it  calls  for  an 
additional  optical  sensor  mounted  on  the  bus  to  perform  multi-point  distributed 
sensing  of  the  relfector  as  shown  in  Figure  7.  The  purpose  of  having  this 
optical  sensor  is  to  obtain  distortion  and  vibration  information  of  the  reflector 
and  boom  directly  by  measurements. 


231 


Option  3 is  option  2 plus  additional  control  devices  at  the  reflector  hub. 
It  represents  a spatially  distributed  control  scheme  for  the  antenna  such  that 
the  distortions  of  the  short  boom  can  also  be  controlled  effectively  with 
actuators  located  at  the  reflector  hub. 


shape/vibration 
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Figure  6.  Two-Site  Control  System  for  Hoop  and  Column  Antenna 


Figure  7.  Wrap-Rib  Control  for  Option  3. 

C.  Characterization  of  Disturbances 

The  disturbances  discussed  here  are  limited  to  the  gravity  gradient  and 
gyroscopic  torques  due  to  nonprincipal  axis  pointing,  solar  pressure  torques, 
and  torques  caused  by  thruster  firings  for  wheel  dumping  and  station  keeping. 
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Consider  the  LMSS  antenna  applications.  With  the  current  hoop  and  column 
antenna  design,  the  body  z-axis  must  tilt  6°  from  nadir  in  order  to  provide 
the  necessary  communication  coverage.  The  orbital  configuration  is  shown  in 
Figure  8a.  A gravity  gradient  torque  and  gyroscopic  torque  of  -2,52x10“^  N~m 
is  created  about  the  x-axis . Since  this  torque  is  noncyclic,  a periodic  wheel 
dumping  at  the  rate  of  217.73  N-m-s  for  every  24  hours  is  required.  The  solar 
pressure  torques,  on  the  contrary,  are  cyclic.  With  the  availability  of  a 
detailed  configuration  for  a 122  mD  antenna,  the  solar  pressure  torques  for  the 
latter  have  been  computed  for  arbitrary  sun  inclination  and  orbital  position. 
The  solar  pressure  torques  for  the  64  mD  antenna  are  obtained  by  scaling  and 
considering  the  shift  of  the  center  of  mass.  The  projected  solar  pressure 
torques  are  shown  in  Figure  8b. 

For  the  wrap-rib  antenna,  due  to  the  structurally  imbalanced  configuration 
a -16°  tilt  of  the  principal  axis  results.  This  pronounced  non-principal  axis 
pointing  has  created  a non-cyclic  x-axis  gravity  gradient  and  gyroscopic  torque 
of  2.06x10-2  N-m.  The  solar  pressure  torques  are  cyclic.  Figure  9 shows 
these  torques  over  a 24-hour  period. 


MIDNIGHT 
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“Co;S^| 


P = 9.6073  X 10"®  Ib/ff^ 
(4.6  X lO"^  N/M^) 


d,  = SUN  INCLINATION 
' ANGLE 


SUN  RADIATION 
VECTOR 


Figure  8a.  Orbital  Configuration  (a  = tilt  angle,  8 = orbit  angle) 
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Figure  8b.  Solar  Pressure  Disturbance  Torques  for  Hoop  and  Column  Antenna 
(a  = 6°  and  0^  = 0°)  . 


Figure  9.  Solar  Pressure  Disturbance  Torques  for  Wrap-Rib  Antenna 

/\  G 


D.  Control  Design 


Consider  the  steady  state  stochastic  linear  optimal  output  feedback 
regulator  problem.  The  following  formulation  applies  to  both  the  hoop  and 
column  and  the  wrap-rib  configurations;  the  only  differences  are  in  the  assign- 
ment of  variables  and  coefficient  matrices.  Let  x and  x be  the  state  vector 
and  the  state  estimate  and  let  z be  the  measurement  vector.  Let  F,  G,  and 
H be  the  assumed  state  matrix,  control  influence  matrix,  disturbance  distribu- 
tion matrix,  and  the  measurement  distribution  matrix,  respectively.  The  state 


and  the  state  estimator  equations  are: 

X = Fx  + Gu  + Tw  (7a) 

z = Hx  + V (7b) 

u = -K  X (7c) 

c 

• 

X = Fx  + Gu  + K (z  - Hx)  (7d) 

e 

The  cost  indeix  to  be  minimized  is 


J = f E{ 


(x'^Ax  + u'^Bu)  dt} 


(8) 


The  optimal  control  gains  and  the  filter  gains  are,  respectively 


-1  T 

K = B G S 
c 

T -1 

K = PH  R 
e 


(9a) 

(9b) 


where  S and  P satisfy  the  following  algebraic  equations,  respectively 


T 

-1  T 

+ A = 0 

SF  -t-  F S 

- SGB 

G S 

T 

T -1 

T 

FP  + PF 

- PH  R 

HP 

4-  PQ  r 

(10a) 
(10  b) 


where  A and  B are  the  state  and  the  control  weighting  matrices  and  u is  the 
control  vector;  and  w and  v are  the  plant  disturbances  and  the  measurement 
noise  vector,  respectively.  Q and  R are  the  power  spectral  density  matrices 
for  the  plant  disturbances  and  the  measurement  noises,  respectively . The  F 
matrix  is  block  diagonal  and  can  be  constructed  from  the  equations  of  motion; 
the  G,  H,  and  T matrices  can  be  obtained  from  the  eigenvector  data. 


For  the  hoop  and  column  antenna,  consider  the  three  rigid  body  modes  and 
the  first  seven  elastic  modes.  Examination  of  the  eigenvectors  of  the  elastic 
modes  reveals  that  for  small  deformations  and  attitude  excursion,  the  inter- 
actions between  the  pitch  axis  and  the  roll  and  yaw  axes  are  negligible. 
Therefore,  the  pitch  controller  can  be  treated  separately  from  the  roll  and  yaw 
controller.  Let  p be  the  subscript  for  the  pitch  controller,  and  y the 

roll  and  yaw  controller,  then  the  state  vectors,  the  measurement  vectors, 
etc.,  for  the  hoop  and  column  controllers  are  assigned  as  follows: 
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(11a) 


(9,0;  qg.qg; 


Z = 


«pb-®pb>  %fV>’ 


w - 


(T  T »F  F )■ 
dyb’  dyf  * dxb’  dxf'’ 


V = (v  , ,v  , tv  - ,v  - ) ' 
p yab*  yrb’  yaf ’ yrf 


and 


u = 


(llb) 

(llc) 

(lid) 

(lie) 


X = 

Y 

(<l>,'J;  '^ii’^ii’  '^I2’'^i2^ 

(12a) 

z = 
Y 

^‘^yb’^yb’  \b’\b’  \f’'^yf’ 

(12b) 

w = 
Y 

T 

(T  T -T  T «F  F ) 

dxb’  dxf  ’ dzb’  dzf  ’ yb’  yf"^ 

(12c) 

V = 

Y 

T 

^^xab’^xrb’  ^zab’^zrb’  '^xaf ’^xrf  ’ ^zaf’^^zrf^ 

(12d) 

u = 
Y 

T 

(T  T »T  T ) 

^ xb’  xf’  zb’  zf' 

(12e) 

where  in  Eqs.  (11)  and  (12),  f refers  to  feed  base,  b refers  to  spacecraft  bus, 
a and  r refer  to  attitude  and  rate,  respectively. 


Hardware  sizing  allows  for  assessment  of  onboard  disturbances.  Using 
the  subscripts  SK  for  station  keeping  and  WD  for  wheel  dumping,  the  estimated 
power  spectral  density  (PSD)  matrices  for  the  random  disturbances  are 


%SK 

= Diag  (.02,  . 02  ;4. 82x10"^, 9. 68x10"'^) 

(13a) 

^pWD 

= Diag  (.02,  .02) 

(13b) 

^ySK 

= Diag  (.02,. 02 ’.02,. 02; 4. 82x10“^, 9. 68x10"^) 

(14a) 

O' 

= Diag  (.02, .02-.02, .02;9. 68x10“^, 9. 68x10“^) 

(14b) 

The  PSD 

for  the  measurement  noises  are  assumed  as  follows: 

and 

R = 
P 

-10  -9  -1 0 -9 

Diag  (3.2x10  ,3.06x10  "^;3.2xl0  ,3.06x10  ) 

(15) 

II 

-in  -9  -10  -9 

Diag  (3.2x10  ,3.06x10  ;...;3.2xl0  ^,3,06x10  ) 

(16) 
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The  selection  of  the  state  and  control  weighting  matrices  determines  the 
control  design.  The  state  weighting  matrix  must  reflect  the  control  objectives 
such  as  requirements  on  stability ^ surface  and  pointing  errors,  etc.  Consider 
the  pitch  control.  Let  Ap  = Diag  (al,a2 ;a3 ,a4 ;a5 ,a6) . The  odd  elements,  in 
general,  contribute  to  the  stiffness  of  the  closed -loop  state  and  the  even  terms 
affect  the  damping. 


The  a2n+l  terms  are  determined  by  the  weighted  normalized  mean  square 
structural  attitude  angles,  feed  displacements,  and  dish  deformations;  or  more 
specifically,  they  may  be  expressed  as  follows: 


g 


0 


1 ? 


(17a) 


^F 


2 I «9xl'  + Si'  Sil') 


? + ‘*’9y/  ‘‘’9z/^ 

"d 


(17b) 


8f 


5 .c  ^2  ^ 13xi  ■ -^ISyi  ■ '^13zi 


-h 


? (^13xl'  + ^13yi'  + ^13zl'> 

"d 


(17c) 


where  the  subscripts  F and  D refer  to  feed  and  dish,  respectively;  nj)  is  the 
number  of  grid  points  on  the  dish,  ip  means  grid  point  i on  the  dish,  <j>9xi  is 
the  translational  mode  shape  for  mode  9 at  grid  i;  Sp  is  the  specified  rms 
surface  displacement.  Other  parameters  are  similarly  defined.  The  weighting 
factors  g0,  gp,  and  gp  reflect  the  relative  importance  of  the  individual  require 
ments.  The  values  for  the  weighting  factors  were  chosen  to  be  10,  20,  5, 
respectively.  This  selection  was  guided  by  the  feed  and  reflector  optical 
properties.  The  construction  of  the  roll  and  yaw  state  weighting  matrix  is 
similar , 


For  the  wrap-rib  antenna  control  design,  the  state  vector  contains  24 
elements  representing  the  12  modal  amplitudes  and  the  12  modal  rates  , In 
the  first  half  of  the  state  vector,  the  first  three  states  are  the  antenna 
attitude  (roll,  pitch,  yaw),  the  next  three  are  associated  with  boom  distortions 
and  the  last  six  are  dish  vibration  modes. 
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Similar  to  the  hoop-column  antenna  control  designs,  the  development  of 
control  and  estimator  gains  requires  good  engineering  judgment  in  selecting 
weighting  matrices  A,  B,  Q,  and  R.  Here  A is  a 24  x 24  matrix  of  the  form 

A = Block  diag  (A^ , Ag,  A^ , A^ , Ag,  A^)  (18) 

where 

Aq  is  3 X 3 weighting  for  attitude 
A^  is  3 X 3 weighting  for  boom  distortion 

D 

Ap  is  6 X 6 weighting  for  dish  distortion 

A^  , Ag,  and  A^^  are  corresponding  rate  weightings. 

For  all  wrap-rib  antenna  control  designs,  matrix  A stays  fixed  through- 
out, elements  of  which  are  chosen  as  discussed  in  the  following  paragraphs. 

Aq  is  chosen  with  weighting  matrix  B such  that  the  closed-loop  rigid 

body  modes  have  equivalent  frequencies  of  about  1/5  of  the  lowest  open -loop 

vibration  frequency  of  the  system  (which  is  0.55  rad/sec).  To  achieve  this, 

the  following  relation  has  been  used  to  define  A. 

y 

aJ  = I^BIU  b ^ (19) 

t)  w 

where 

I represents  the  3x3  inertia  tensor  of  system 

B is  the  3x3  control  weighting  matrix 

U is  the  3x3  identity  matrix 

b is  the  desired  bandwidth 
w 

Ag  is  chosen  as  a 3 x 3 diagonal  matrix,  with  each  element  equal  to  the 
inverse  of  the  required  boom  distortion  error  squared.  For  this  study,  boom 
distortion  is  given  in  terms  of  angles  at  the  boom  joint  and  the  requirement 
is  set  at  Q.Q1°  for  each  axis.  Therefore, 

Ag  = Diag  (3.27x1o7,  3.27x1Q^,  3.27x1Q^)  (20) 


Ag  is  the  weighting  for  dish  modes  and  is  chosen  to  minimize  dish 


RMS 


surface  errors.  The  following  relation  is  used  to  determine  A. 


D' 


■^7' 

i 

'^7  ' 

^8 

^8 

^12 

X12 

2 

[dish  RMS  surface  error  in  meters] 

[dish  RMS  surface  required  accuracy  in  meters] 
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-3  2 

the  dcinominator  is  set  at  (12x10  ) for  the  12  nun  dish  EMS  surface  accuracy 

requirement.  The  dish  RMS  surface  error  is  given  by 


_1 

6624 


1/2 


(22) 


where  is  the  6 selected  mode  shapes  of  the  dish  and  thus  its  dimension 

is  6624  X 6,  with  6624  being  the  number  of  nodes  of  the  dish. 


From  Eq . (21)  and  Eq . (22),  is  given  by 


6624 


(12x10  ) 


T 

. $ $ 

3^2  D D 


(23) 


The  values  for  Aq , Ag,  and  Ag  have  been  set  to  zero.  Matrix  B is  diagonal 
and  each  element  is  the  inverse  of  the  square  of  the  maximum  control  torque 
specified  for  each  axis. 

After  choosing  A and  B,  weighting  matrices  Q and  R were  selected 
to  obtain  desired  closed -loop  estimator  poles  for  the  rigid  body  modes.  The 
relation  used,  was 

T T 4 

Q = I RIU  b (24) 

Due  to  the  use  of  the  above  relation  in  designing  the  estimator,  the  power 
spectral  density  matrix  Q assumes  different  values  in  the  design  than  in  the 
perfomance  evaluation.  For  system  performance  evaluation,  the  actual  process 
disturbance  PSD  was  used.  However,  sensor  measurement  noise  PSD  Is  Identical 
for  design  and  evaluation  processes.  In  particular,  the  Q matrix  for  perfor- 
mance evaluation  of  systems  1 and  2 is  given  by 

Q = Diag  (9. 8x10"^, 7 .688x10"^, 3. 528x10"^)  (25) 

and  for  System  3 by 

Q = Diag  (9. 8x10“^, 7. 688x10"^, 3. 528x10'^, 9. 8x10“^, 7. 68x10"^, 3. 52x10"^) 

(26) 

The  R matrix  for  design  and  evaluation  is  given  by 

R = Diag  (3*1.52x10“^^,  3*1.88x10"^^)  (27) 

for  3 angular  and  3 rate  measurement  noises . If  an  optical  sensor  is  used 
for  dish  measurement  of  15  points,  R has  15  additional  terms,  i.e,, 

R = Diag  (3*1. 52x10“^^, 3*1. 88x10"^^, 15*8. 0x10"®)  (28) 
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E.  Model  Parameter  Error  Problem  and  Performance  Evaluation  Model 


In  Section  HID  the  control  system  has  been  defined  with  the  assumption 
that  the  structural  parameters  were  known.  However,  in  reality  large  space 
system  parameters  can  not  be  accurately  predicted  on  the  ground.  The  result  of 
this  model  error  could  seriously  destabilize  the  system.  Let  F'  be  the 
actual  state  matrix  and  F be  the  modeled  state  matrix.  Let  e be  the  estimator 
error,  x-x.  The  closed -loop  system  may  be  represented  as 
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’f'-GK 
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-GK 
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F-F' 

F-K  H. 
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(29) 


In  the  absence  of  model  errors,  i.e.,  F '=F , the  eigenvalues  of  the  closed-loop 
system  are  those  determined  by  det  (F-GK^)  and  det  (F-K^H) . For  this  type  of 
controller,  the  system  is  asymptotically  stable.  However,  when  F'  ^ F,  stability 
can  not  be  guaranteed  and  destabilization  can  occur  if  F'  is  sufficiently 
different  from  F as  demonstrated  in  the  next  section.  Note  in  Eq . (29) 
the  gain  matrices  are  designed  based  on  the  modeled  system  parameters. 


The  control  system  performances  that  are  of  particular  interest  are  (1)  the 
attitude  and  surface  accuracy  achievable  in  the  projected  disturbance  environ- 
ment, (2)  the  relative  stability,  and  (3)  the  robustness  of  the  controller  in 
the  presence  of  model  parameter  errors.  These  performance  measures  may  be 
evaluated  through  computer  simulations  of  the  transient  responses  or  the  steady 
state  analysis  of  the  statistical  averages.  The  relative  stability  can  be 
obtained  from  the  closed-loop  eigenvalues.  The  parameter  error  sensitivity 
problem  can  be  evaluated  using  Eq . (29)  and  the  following  formulation.  In  the 
remainder  of  this  subsection  we  shall  concentrate  on  the  development  of  the 
evaluation  models  for  the  attitude  errors,  surface  deformations,  etc.  using  the 
state  covariance  matrix,  X^. 

Let  Xqp  and  X^y  be  the  augmented  state  covariance  matrices  for  pitch  and 
roll  and  yaw  control  and  estimation,  respectively.  The  values  for  X^p  and  XQy 
may  be  obtained  by  solving  the  following  matrix  algebraic  equations: 
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where  F^p , F^y,  and  fj^y  are  defined  in  Eq.  (29)  and  Q^p  and  Q^y  are  the 

power  spectral  densities  of  WQp  and  w^y,  respectively,  also  defined  in  Eq.  (29). 
The  rms  attitude  error  cf^  for  the  structure,  the  rms  dish  deformation  Cp , and 
the  rms  feed  displacement  Op  may  be  defined  in  terms  of  the  covariance  matrices 
as  follows: 
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where  X(|),  X0 , X,|,,  X and  X^y  are  subraatrices  of  Xp  and  Xyj  and  ‘(’pp,  ‘J’Dy » “^Fp » 
and  ‘(’pY  are  submatrices  of  the  eigenvector  matrix  For  instance,  Xqp  and 
top  are 
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360x2 

and  in  Eqs . (33)  and  (34)  tr  is  the  trace  of  the  matrix  in  the  argument. 

The  rms  control  effort  is  also  of  interest  and  can  be  obtained  from  the 
control  covariance  matrices  as  follows; 


U = K X K 
P pc  p pc 


(37a) 


U = K X K 
y yc  y yc 

U = [tr (U  + U ) ] 
rms  p y 


1/2 


(37c) 


The  rms  attitude  error  in  Eq.  (32)  is  the  attitude  of  the  entire  structure  or 
the  rigid  body  attitude.  It  is,  therefore,  not  the  beam  pointing  attitude. 

The  actual  beam  pointing  error  is  a function  of  the  relative  motions  of  the 
feed,  the  dish,  and  the  dish  surface  deformation  in  addition  to  the  structural 
attitude  defined  in  Eq.  (32). 

The  attitude  errors  at  the  feed  and  at  the  bus  can  be  easily  defined, 
however,  in  terms  of  the  variables  discussed  in  this  section. 

IV.  DISCUSSION  OF  RESULTS 
A.  Hoop  and  Column  Control  Performance 

Designs  discussed  in  the  preceding  subsections  have  yielded  reasonable 
performance.  Figure  10  shows  the  feed  and  the  bus  pointing  errors  with  parameter 


241 


error  varying  from  70%  to  130%.  The  controller  becomes  unstable  if  parameter 
errors  exceed  these  boundaries.  Figure  11  shows  a similar  result  for  the  feed 
displacement.  In  all  cases  when  the  actual  frequencies  increased,  the  perfor- 
mance has  improved  somewhat.  This  is  because  when  frequency  increases,  the 
structure  becomes  stiffer. 

One  important  question  that  needs  to  be  addressed  here  is  how  much  a two- 
site  control  is  better  than  a one-site  control  system.  Figure  12  demonstrates 
that  the  two-site  control  system  is  more  robust  than  either  of  the  two  one-site 
control  systems. 


Figure  10,  RMS  Pointing  Accuracy  for  Hoop  and  Column  Antenna. 


Figure  11.  RMS  Feed  Displacement  for  Hoop  and  Column  Antenna. 
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Figure  12.  Stability  Regions  for  Three  Control  Systems  for  Hoop  and  Column 
Ant  enna 


Further  comparisons  for  the  three  control  concepts  are  in  Figure  13  where 
the  identifications  of  critical  modal  parameters  are  made.  The  bending  modes 
are  less  critical  to  all  but  the  single-site  feed  control  system.  The  torsional 
modes  are  critical  to  all  the  systems  and  they  set  the  stability  limits  for  both 
the  single-site  bus  control  and  the  two-site  control  systems. 
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Figure  13.  Stability  Regions  Subject  to  Parameter  Errors  for  Hoop  and  Column 
Antenna 


243 


B.  Wrap-Rib  Control  Performance 


Consider  the  first  case  where  actual  boom  frequencies  are  the  same  as  the 
boom  frequencies  used  in  the  control  designs.  Figure  14  shows  the  dish  LOS 
(Line-of -Sight)  stability  error  as  a result  of  having  1 newton-meter  sinusoidal 
disturbance  torque  applied  to  the  antenna.  For  example,  in  cases  where  the 
disturbing  sinusoidal  torque  has  the  same  frequency  as  the  first  vibration 
frequency  of  the  antenna  at  0.55  rad/sec,  the  dish  LOS  error  will  be  0.03°  for 
control  system  1,  0.015°  for  control  system  2,  and  0.002°  for  control  system  3. 
This  means  that  having  capabilities  of  optical  sensing  and  extra  control  at 
dish  hub,  system  3 is  able  to  bring  peak  errors  down  by  an  order  of  magnitude 
and  distribute  the  errors  more  uniformly. 

In  addition,  system  3 provides  more  stable  and  robust  performance  than 
the  other  two  systems  as  actual  boom  frequencies  decrease.  This  is  illustrated 
in  Figures  15  and  16.  As  actual  boom  frequencies  decrease  to  62.5%  of  the 
design  frequencies,  the  peak  LOS  errors  for  system  3 is  about  2.5  times  better 
than  that  of  system  2,  whereas  system  1 is  already  unstable.  Similarly,  in 
the  last  case  where  actual  boom  frequencies  are  60%  of  the  design  boom  frequen- 
cies, the  peak  LOS  error  for  system  3 is  at  about  0.02°,  and  both  system  1 and 
system  2 are  unstable. 


Figure  14.  Wrap-Rib  Dish  Llne-of -Sight  Error  with  Exact  Boom  Frequency  in 
Model . 
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Figure  17  shows  the  peak  dish  LOS  error  for  the  three  systems  as  functions 
of  boom  frequency  errors  which  has  further  illustrated  the  superior  performance 
of  System  3 than  those  of  Systems  1 and  2. 


ASSUMED  BOOM  FREQ 

Figure  17.  Wrap -Rib  Peak  Dish  Line-of -Sight  Error  vs.  Boom  Frequency  Error. 


V.  CONCLUSIONS 

1.  Structural  uncertainties  and  model  error  can  cause  serious  performance 
deterioration  and  can  even  destabilize  the  controllers. 

2.  For  the  hoop  and  column  antenna,  large  hoop  and  long  mast  and  the  lack 
of  stiffness  between  the  two  substructures  result  in  low  structural  frequencies. 
Performance  can  be  improved  if  this  design  can  be  strengthened. 

3.  The  two-site  control  system  is  more  robust  than  either  single-site 
control  systems  for  the  hoop  and  column  antenna.  The  two-site  control  concept 
has  resulted  in  reasonable  hardware  requirements  when  applied  to  LMSS  communi- 
cation missions. 

4.  For  the  wrap-rib  antenna,  the  uncertainties  in  boom  dynamics  and  its 
stiffness  are  critical  to  the  control  design. 

5.  By  adding  hub  sensor  and  hub  actuators,  System  3 has  out  performed 
Systems  1 and  2 by  a great  margin  in  the  wrap-rib  configuration. 

6.  Flight  tests  and  in-orbit  system  identification  of  critical  structural 
modes  will  insure  performance  and  reduce  risk  for  large  space  antenna  missions. 
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SESSION  III:  DISCUSSION  - LSS 
CONTROL  PROBLEMS 


Moderator:  R.R.  Strunce,  Charles  Stark  Draper  Laboratory,  Inc. 

Panel  Mereibers:  N.K.  Gupta,  Integrated  Systems,  Inc.,  J.  Sesak,  General  Dynamics, 
and  A.F.  Tolivar,  Jet  Propulsion  Laboratory 


SYNOPSIS* 

Each  panel  member  gave  an  opening  statement,  then  a general  discussion  with 
the  audience  followed. 

N.  K.  GUPTA:  The  problem  may  be  divided  into  three  areas  for  discussion: 

analytical  developments,  real-time  processing,  and  actuators  and  sensors. 

In  analytical  developments,  insufficient  emphasis  has  been  placed  on 
numerical  techniques.  Adequate  numerical  processing  techniques  do  not  exist 
to  deal  with  the  high-order  models  used  in  control  of  large  space  structures. 

Real-time  processing  will  be  needed  at  very  high  computation  rates,  perhaps 
10  million  to  100  million  operations  per  second.  The  DoD  Very  High  Speed 
Integrated  Circuits  (VHSIC)  program  may  provide  a partial  solution.  It  may  be 
necessary  to  look  into  other  techniques  involving  both  analytical  and  real-time 
processor  developments. 

Reliability  will  be  extremely  important  in  these  very  complex  systems.  The 
problem  of  developing  high-order  languages,  etc.,  that  will  allow  convenient 
and  reliable  real-time  processing  and  verification  of  control  laws  must  be 
looked  into. 

Actuators  and  sensors  that  have  to  operate  in  the  space  environment  for 
several  years  are  very  important.  Basic  actuator  and  sensor  limitations  should 
be  taken  into  account  in  analytical  developments.  For  example,  some  robustness 
proofs  are  not  valid  when  actuator  and  sensor  dynamics  are  considered. 

It  is  a very  big  jump  going  from  analytical  studies  to  full  space  flight 
experiments.  It  seems  that  intermediate  steps  of  simulation  and  laboratory 
experiments  vrould  be  very  useful  and  cost-effective  before  going  into  flight 
experiments . 

J.  SESAK:  My  discussion  focuses  on  experiments  for  large  space  structures 

technology.  Most  laboratory  experiments  conducted  to  date  have  been  with  very 
simple  examples,  such  as  beams  and  flat  plates.  Nothing  tested  resembles,  even 


*Thls  synopsis  attempts  to  capture  the  main  points  discussed  but  has  not  been 
reviewed  or  endorsed  by  the  speakers. 
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remotely,  an  actual  space  structure.  More  complex  structures  and  tests  that 
closely  represent  actual  space  structures  should  be  used  in  experiments. 

Flight  experiments  will  be  necessary  to  achieve  realistic  data,  particularly 
with  respect  to  damping,  in  the  space  environment. 

There  is  the  need  for  actuator  technology  for  operation  at  very  low 
frequencies  (fractions  of  Hz).  Flight  tests  are  needed  not  only  for  verifying 
theories  but  also  for  actuator  and  sensor  hardware, 

A.  F.  TOLIVAR:  The  problem  with  large  space  structures  is  not  just  one  of  high 

frequencies,  but  of  extending  the  actuator  and  sensor  hardware  capability  down 
into  the  very  low  frequency  ranges,  essentially  to  quasi-static  conditions. 

In  several  technical  areas,  the  theory  that  has  been  developed  to  date  needs 
to  be  made  ready  for  flight.  The  theory  should  be  developed  to  the  point  where 
efficient  packages  exist  that  can  operate  on-board  the  spacecraft  with  confi- 
dence. For  example,  if  one  were  asked  for  an  autonomous  system  identification 
program  for  a complex  100-meter  space  system,  it  would  not  be  possible  to 
accomplish  that  in  a cost-effective  manner  with  today's  level  of  technology. 

How  do  we  convince  a project  manager  that  the  technology  is  indeed  ready 
for  mission  applications?  It  is  through  demonstrating  the  technology  both  with 
laboratory  and  flight  experiments  on  realistic  articles, 

R.  R.  STRUNCE:  The  fundamental  issue  is  how  to  design  a finite  compensator  to 

control  infinite-dimensional  structures.  There  are  four  major  technology  areas 
to  pursue:  simultaneous  achievement  of  performance  and  robustness;  system 

identification;  reduced-order  compensators;  and  sensor/actuator  technology.  An 
important  question  for  this  workshop  is:  do  we  have  a solution  looking  for  a 

problem  or  do  we  have  a real  problem  that  needs  solution?  Maybe  it  is  time  to 
move  into  a technology  demonstration  activity. 

Comments  and  Questions  from  the  Audience 

A.  N.  MADIWALE,  MIT  Lincoln  Lab:  Major  problems  in  control  of  large  structures 

are  the  parameter  uncertainties  or  inaccuracies  in  the  large  scale  models.  The 
uncertainties  could  be  from  20%  to  100%.  A workshop  paper  by  Hyland  and 
Madlwale  presents  a technique  that  directly  incorporates  the  parameter  uncertain- 
ties into  the  modeling  process  and  designs  an  optimum  controller  around  these 
uncertainties.  Certain  robustness  properties  are  obtained  by  this  process. 

V.  KOMKOV,  West  Virginia  University:  There  certainly  are  problems  with  modeling 

large  structures  with  very  slender  members.  Modeling  that  represents  the 
physical  reality  has  not  been  developed  very  well.  The  assumptions  used  in 
modeling,  for  example  in  the  Euler-Bernoulli  theory,  are  not  looked  at  very 
carefully  and  often  do  not  apply.  One  might  throw  in  all  kinds  of  uncertainties 
but  it  really  does  not  solve  the  problem  of  having  a reasonable  theory. 

L.  WEISSTEIN,  Lockheed  Missiles  and  Space  Co.  (LMSC) : There  are  available 

analytical  models  that  have  parameters  of  real  space  structures  far  beyond  the 
simple  problems  characterized  by  the  experiments  discussed  here.  Some  conti- 
nuum models  have  been  developed  for  such  structures  as  two-layer  tetrahedral 
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trusses,  hexagonal  trusses,  periodic  trusses,  and  framework  structures.  These 
would  certainly  aid  in  applying  control  theories  to  more  real-life  structures. 
The  manuscript  listed  below  presents  an  extensive  bibliography.* 

A.  M.  MEYSTEL,  University  of  Florida:  There  are  two  different  areas  which 

apply  to  large  space  structures:  control  in  the  small  and  control  in  the  large. 

The  only  topics  covered  here  have  been  linked  with  small  motions.  Large,  quick 
motions  should  also  be  considered  for  which  neither  linear  quadratic  gaussian 
(LQG)  nor  other  approaches  discussed  here  apply.  Research  should  concentrate  on 
other  approaches  than  LQG  that  can  treat  "control  in  the  large." 

We  should  separate  in  our  discussions  "theory  for  design"  from  "theory  for 
control."  They  are  two  different  topics  and  should  be  treated  differently.  The 
model  used  for  preliminary  system  design  may  be  very  different  from  the  one  used 
in  real-time,  on-line  control. 

There  are  many  methods  for  decoupling  control,  but  special  techniques  must 
be  developed  for  on-line  decoupling  for  large  scale  systems. 

The  issues  of  actuator  resolution  and  dynamics  in  the  zero-g  environment 
should  be  given  more  attention. 

Probably  the  most  important  step  that  should  be  taken  is  for  NASA  to  define 
future  applications  and  specifications  for  control  of  large  space  structures  so 
that  the  control  community  can  focus  on  the  same  problems. 

A.  F.  TOLIVAR:  In  response  to  Professor  Leonard  Melrovitch's  question  about  the 

state  of  the  art  of  distributed  actuation  and  sensing,  I have  the  following 
comments. 

The  problem  of  including  actuator  dynamics  in  the  model  of  a large  space 
structure  has  not  been  addressed  to  any  great  extent.  Instrumentation  hardware 
and  associated  measurement  algorithms  that  satisfy  the  performance  requirements 
that  are  space-qualified  are  not  available.  Similarly,  for  the  actuation  area, 
there  does  not  exist  a technology  for  a space-qualified  actuator  that  operates 
at  very  low  frequencies,  perhaps  1/lOOth  of  1 Hz.  Hundreds  of  sensors  and 
actuators  may  be  needed  for  a single  space  system  and  there  is  basically  no 
space-qualified  hardware  available  in  this  area. 

R.  R.  STRUNCE:  Fault  tolerance  becomes  a major  issue  when  considering  systems 

with  hundreds  of  sensors  and  actuators.  How  will  the  wires  (or  fiber  optics)  be 
run?  How  will  failures  be  accommodated?  How  will  the  control  laws  perform  with 
internal  failures?  It  seems  that  this  major  area  is  being  overlooked. 

B.  R.  HANKS,  Langley;  It  is  surprising  to  find  that  in  this  workshop,  as  in 
most  others  on  control  of  large  flexible  structures,  there  are  so  few  structural 
dynamicists.  Keep  in  mind  that  stiffening  up  the  structure  may  be  the  best 


*L.  S.  Weisstein,  Continuum  Models  for  Repetitive  Beamlike  Structures, 
unpublished  master's  thesis,  George  Washington  University,  1982. 
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solution  in  many  cases.  If  actuators  required  for  rigid-body  control  are  not 
adequate  for  flexible-mode  control,  then  maybe  they  are  being  designed  the  wrong 
way.  Maybe  passive  stiffening  and  damping  are  better. 

R.  S.  GRAN,  Grumman  Aerospace:  The  structural  dynamics  people  first  of  all  find 

our  problem  boring  because  we  all  talk  about  linear  systems  and  they  stopped 
talking  about  them  20  years  ago.  The  structural  dynamics  research  problems  nov7 
are  very  complicated  and  involve  nonlinear  partial  differential  equations  and  so 
on.  Control  engineers  like  linear  models  and  those  are  boring.  The  issue  of 
doing  control  has  to  be  divided  into  two  parts:  Open-loop  control  of  the  non- 

linear system  to  give  a nominal  trajectory;  then,  closed-loop  linear  control  of 
perturbations  about  the  nominal  trajectory.  If  the  control  law  does  a good  job, 
then  the  motion  will  always  consist  of  small  perturbations,  and  the  linear  model 
will  be  valid. 

B.  R.  Hanks  has  a very  good  point.  A piece  of  wire  is  a lot  cheaper  than  a 
control  system. 

H.  A.  REDIESS,  Milco:  There  is  a clear  need  to  perform  large-scale  flight 

experiments  in  space,  but  has  everything  been  done  in  laboratory  experiments  that 
is  possible  to  develop  and  verify  techniques? 

B.  R.  HANKS,  Langley:  You  can  not  practically  do  laboratory  experiments  of  large 

flexible  structures  that  have  modes  on  the  order  at  0.1  IIz  or  less  because  the 
suspension  systems  become  an  inherent  part  of  the  structure.  Also,  damping  in  a 
structure  is  very  dependent  on  the  frequency,  so  scaling  of  stiffen  structural 
models  down  to  the  very  low  frequencies  will  not  work.  Laboratory  experiments 
cannot  solve  the  total  problem. 
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NUMBER  AND  PLACEMENT  OE  CONTROL  SYSTEM 
SENSORS  CONSIDERING  POSSIBLE  FAILURES 

Wallace  E.  Vander  Velde*  and  Craig  R.  Carignan** 

Department  of  Aeronautics  and  Astronautics 
Massachusetts  Institute  of  Technology 
Cambridge,  MA  02139 


ABSTRACT 


In  the  future  design  of  active  control  systems  for  large,  flexible  space 
structures  there  is  likely  to  be  some  freedom  as  to  the  placement  of  the  control 
system  sensors  and  actuators.  Certainly  the  designer  will  have  some  choice  over 
the  number  pf  these  components  which  he  specifies.  The  answer  to  the  question 
of  hov7  many  control  system  components  should  be  used  and  where  they  should  be 
placed  on  the  structure  is  not  likely  to  be  obvious.  The  matter  becomes  even 
more  obscure  when  one  considers  the  likelihood  that  some  of  these  components 
will  fail  during  the  operating  lifetime  of  the  system. 

This  paper  presents  a methodology  which  is  intended  to  assist  the  designer 
in  this  process  with  respect  to  sensors.  The  suggested  approach  begins  with 
definition  of  a Degree  of  Observability  of  the  system  for  a given  set  (number 
and  location)  of  sensors.  This  measure  of  observability  is  a quantitative  indi- 
cator of  how  well  the  system  can  be  observed  with  a given  set  of  sensors.  The 
issue  of  component  unreliability  is  then  introduced  by  computing  an  average 
Degree  of  Observability  over  the  operating  lifetime  of  the  system  accounting 
for  the  likelihood  of  various  combinations  of  component  failures.  This  measure 
reflects  the  basic  capability  of  the  sensor  set  to  observe  the  system  state  in 
the  context  of  the  failures  which  will  probably  occur.  Having  defined  this 
average  measure  of  the  performance  capability  of  the  set  of  sensors,  one  can 
optimize  the  locations  of  a given  number  of  components  by  maximizing  the 
measure,  and  one  can  see  how  this  optimized  measure  varies  with  the  number  of 
components.  This  provides  the  designer  with  a meaningful  basis  for  choice  of 
number  and  location  of  control  system  sensors. 


INTRODUCTION 


The  dimensions  of  space  structures  being  considered  for  future  applications 
are  on  the  order  of  several  hundred  meters  to  several  kilometers  and  will  re- 
quire a large  number  of  actuators  and  sensors  for  attitude  and  shape  control.  A 
solar  power  satellite,  for  Instance,  may  require  hundreds  of  control  moment 
gyros  and  thrusters  to  damp  out  surface  vibrations  caused  by  periodic  disturb- 
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ances  such  as  solar  and  gravity  gradient  torques.  One  of  the  first  decisions  a 
designer  of  the  control  system  for  such  an  assembly  must  make  is  how  many  sen- 
sors and  actuators  to  use  and  where  to  place  them  on  the  structure.  Placement 
represents  a substantial  degree  of  freedom  available  to  the  designer  and  is 
usually  not  a very  straightforward  question.  It  is  even  less  apparent  when  one 
considers  redundancy  in  the  system  to  allow  for  failures;  even  if  the  "optimal" 
position  of  -a  sensor  is  known,  it  may  not  be  so  clear  where  a backup  sensor 
should  be  placed.  The  number  of  components,  to  be  used  must  reflect  the  trade- 
off of  cost,  weight,  power,  etc.  vs.  system  performance— and  the  evaluation  of 
performance  should  recognize  the  likelihood  of  some  component  failures  during 
the  lifetime  of  the  system. 

The  effect  of  component  failures  is  emphasized  here  because  even  an  elemen- 
tary analysis  demonstrates  that  component  unreliability  will  have  a dominating 
influence  on  the  behavior  of  these  systems.  The  control  system  must  be  con- 
figured with  the  capability  to  accommodate  component  failures,  and  the  likely 
effect  of  these  failures  on  the  performa;nce  of  the  system  must  be  anticipated 
from  the  beginning  of  the  design  process.  As  one  example  of  this,  suppose  a 
system  is  to  operate  for  three  years  between  visits  for  maintenance  and  re- 
supply. If  the  control  system  utilizes  a total  of  400  sensors  and  actuators — 
each  with  an  exponential  distribution  o-f  time  to  failure  with  a mean  time  to 
failure  of  100,000  hours  (optimistic  by  today's  standards) — the  probability  that 
none  of  these  components  will  fall  in  this  Interval  is  about  2 • 10~'^^,  which 
indicates  virtual  certainty  that  one  or  more  failures  will  occur,  In  fact,  many 
failures  are  likely  to  occur — ^about  one  every  10  days  on  average! 

In  this  work  we  develop  a methodology  for  measuring  the  performance  of  a 
system  which  reflects  the  type,  number  and  placement  of  the  sensors  on  the  struc- 
ture. The  measure  also  reflects  the  expected  loss  of  performance  due  to  compo- 
nent failures.  This  performance  measure  is  intended  to  be  especially  useful  as 
a guide  to  the  choice  of  sensor  number  and  placement  in  the  earliest  phase  of 
system  design. 

Problem  Definition 

It  would  be  most  helpful  to  the  control  engineer  to  have  some  criterion  at 
his  disposal  for  placing  sensors.  Unfortunately,  modern  control  theory  does  not 
provide  any  such  measure  of  "observability."  Observability  is  simply  a Unary 
concept — either  a system  is  observable  or  it  is  not.  It  does  not  say  how  ob- 
servable a system  is  with  a given  set  of  sensors. 

Some  work  has  appeared  in  recent  literature  which  addresses  this  question. 
Juang  and  Rodriquez  (Ref.  1)  formulate  the  linear  quadratic  regulator  problem 
with  an  infinite  time  horizon.  The  solution  defines  the  optimal  cost  as  an 
explicit  function  of  the  initial  state,  and  indirectly  as  a function  of  the 
number  and  location  of  the  actuators  and  sensors.  The  expectation  is  taken 
over  a defined  distribution  of  initial  conditions— producing  the  expected  cost 
as  a function  of  the  actuator  and  sensor  set  only.  While  this  approach  has  some 
appeal,  we  found  several  objections  to  the  method:  (1)  The  weighting  of  control 

effort  versus  state  excursions  in  the  cost  function  is  rather  arbitrary;  (2)  If 
there  is  a particular  direction  in  the  state  space  in  which  the  state  is  not 
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very  observable,  this  fact  is  largely  lost  when  the  cost  is  averaged  over  the 
distribution  of  initial  states;  (3)  The  expected  cost  is  an  inverse  measure  in 
which  larger  cost  means  poorer  observability  and  becomes  infinite  when  the  sys- 
tem is  unobservable;  and  (4)  The  expected  cost  depends  on  both  the  actuator  set 
and  the  sensor  set  whereas  it  may  be  more  useful  to  have  separate  measures  of 
observability  and  controllability. 

Skelton  and  Hughes  (Ref.  2)  define  measures  in  terms  of  controllability  and 
observability  "norms"  which  apply  to  the  individual  modes  of  a system  rather 
than  to  the  system  as  a whole.  Their  approach  is  also  tailored  to  "linear 
mechanical  systems"  which  have  a special  form  of  representation  as  a second 
order  matrix  differential  equation.  Although  that  form  applies  to  space  struc- 
ture dynamics,  we  prefer  to  define  measures  which  have  a physical  Interpretation 
in  terms  of  control  or  estimation  error  characteristics  for  general  linear  sys- 
tems . 

What  we  seek,  then,  is  a quantitative  measure  of  how  well  the  system  state 
can  be  observed  with  a given  set— —number  and  location — of  sensors.  This  measure 
should  have  as  clear  an  interpretation  as  possible,  be  easy  to  compute,  and 
should  not  depend  on  the  design  of  the  control  system.  Moreover,  it  should 
properly  reflect  the  loss  of  capability  due  to  the  sensor  failures  which  are 
likely  to  occur  over  the  mission  period. 


Dynamic  Measure  of  Observability 

Any  measure  of  the  observability  of  a dynamic  system  should  reflect  as  di- 
rectly as  possible  the  amount  of  information  which  can  be  derived  about  the  sys- 
tem states  from  the  sensor  outputs  in  a given  amount  of  time.  The  means  of  ob- 
taining this  information  is  by  attaching  to  the  system  an  observer  whose  states, 
k,  are  estimates  of  the  true  states  of  the  system.  The  more  information  that  is 
obtained  about  the  system,  the  smaller  the  estimation  error  becomes. 

In  the  parallel  problem  of  indicating  the  controllability  of  a system  with 
a given  set  of  actuators  it  is  not  obvious  how  the  control  should  be  used  so  as 
to  realize  the  best  possible  control  of  the  system.  What  is  "best"  might  be  de- 
fined in  a number  of  reasonable  ways.  But  in  this  case,  if  we  use  a linearized 
description  of  the  system  dynamics  from  the  outset,  then  the  best  way  to  process 
the  sensor  data  so  as  to  minimize  estimation  error  is  obvious.  The  linear  esti- 
mator which  minimizes  the  state  estimation  error  vector,  e = x - x,  in  a mean 
square  sense,  i.e.,  minimizes 


S = e"'^  Me  (1) 

where  M is  some  weighting  matrix,  is  the  Kalman  Filter. 

With  the  state  dynamics  and  measurement  relation  given  as 


X = Ax 

+ Bu  + w 

(2) 

y = Cx 

+ V 

(3.) 
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the  estimation  error  covariance  matrix  with  Kalman  filtering  obeys  the  equation 

P = AP  + Pa"*^  - PC^n'^CP  + Q (4) 

where  P is  the  estimation  error  covariance  matrix,  and  N and  Q are  the  measure- 
ment and  driving  noise  intensity  matrices,  respectively.  But  the  estimation 
error,  or  its  covariance  matrix,  is  an  inverse  indication  of  the  amount  of  in- 
formation one  has  about  the  state.  We  wish  the  measure  of  observability  to  be 
a direct  indicator  of  information,  so  to  that  end  we  introduce  the  information 
matrix — the  Inverse  of  the  error  covariance  matrix. 

J = P~^  (5) 

One  other  consideration  is  that  the  observability  measure  should  be  a 
property  of  the  system  and  a set  of  sensors— nothing  else.  Since  the  measure- 
ment noise  is  a property  of  the  set  of  sensors  being  evaluated,  we  retain  its 
inclusion  in  (4)  in  the  form  of  N but  do  not  include  the  effect  of  state  driving 
noise,  because  that  is  an  external  influence  not  related  to  the  sensor  set. 

Thus,  if  we  set  Q = 0,  then  (4)  in  terms  of  J becomes 

J = - JA  - a"^J  + c\"^C  (6) 

Take  as  the  standard  situation  the  case  in  which  there  is  no  Information  about 
the  state  initially  and  data  is  collected  up  to  a specified  time  T.  Then 
J (0)  = 0 and  one  is  interested  in  J (T) . 

Having  the  information  matrix  at  time  T,  we  need  some  scalar  measure  of  how 
large  the  matrix  is  as  an  indication  of  how  much  information  has  been  generated 
by  optimal  processing  of  the  sensor  data.  One  way  of  measuring  the  size  of 
J (T)  is  by  reference  to  the  quadratic  surface 

T -1 

V J V = 1 (7) 

Equation  (7)  defines  an  ellipsoidal  surface  in  v-space.  If  J is  a diagonal 
matrix  (one  can  always  transform  to  principal  coordinates),  one  observes  that 
increasing  an  element  will  expand  the  ellipsoid  in  the  direction  v^.  Thus 
the  larger  J becomes,  the  larger  the  volume  encompassed  by  the  surface  in  (7) 
so  that  the  more  information  obtained  about  the  system,  the  larger  the  volume 
becomes. 

Typically,  however,  some  components  of  x will  be  of  greater  concern  than 
others — especially  considering  that  different  units  will  apply  to  different 
components.  Regardless  of  how  one  were  to  approach  the  definition  of  a measure 
of  observability,  it  is  clear  that  a scaling  of  the  state  variables  to  reflect 
their  relative  importance  to  the  success  of  the  mission  would  be  required.  To 
that  end,  define  the  transformation 
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z = Fv 


(8) 


where  e,-  l5>  the  maximum  error  one  is  willing  to  tolerate  in  the  direction  x.. 

The  more  error  one  is  willing  to  tolerate  in  that  direction,  the  greater  the 
transformed  state  so  the  larger  the  volume  becomes.  Thus  the  scaling  is  con- 
sistent with  the  requirement  that  the  more  important  a variable  is 

(smaller  e.-  ) the  smaller  it  will  appear  in  the  transformed  space  and 

-^mcix ' 

the  less  observable  it  appears  unless  the  sensor  set  is  adjusted  to  favor  that 
variable.  Also  note  that  v has  units  of  reciprocal  error,  so  z is  dimensionless. 


Now  that  the  axes  have  been  scaled  so  that  it  is  equally  important  to  ob- 
tain information  in  each  direction,  one  can  interpret  the  size  of  the  surface 
generated  by  the  information  matrix  in  terms  of  the  volume  enclosed  by  that  sur- 
face. But  the  volume  alone  does  not  reflect  the  shape  of  the  surface.  An  ideal 
allocation  of  sensors  would  produce  a sphere  in  the  space  of  equally  important 
variables.  Thus  if  two  sensor  sets  produce  surfaces  enclosing  equal  volume,  we 
would  favor  the  set  which  produced  the  more  nearly  spherical  surface. 

After  considering  a number  of  alternatives,  the  degree  of  observability  was 
chosen  to  be  the  following: 

DO  - [vs  + ^ (V,  - V^) 

where  Vg  is  the  n-dimensional  volume  of  the  ellipsoid , in  the  transformed  space 
and  Vg  is  the  volume  of  the  largest  Inscribed  sphere;  n is  the  dimension  of  the 
state  space.  The  first  term  on  the  right  side  of  (9)  is  the  predominant  term  in 
the  observability  measure;  it  reflects  the  smallest  magnitude  of  reciprocal 
error  relative  to  the  maximum  tolerable  error  (largest  value  of  error  relative 
to  which  remains  after  optimal  processing  of  the  sensor  data  for  time  T 

starting  with  no  Initial  information.  If  the  sensors  were  ideally  allocated, 
the  surface  would  be  a sphere  and  Vg  would  be  the  observability  measure.  The 
second  term  in  (9)  adds  a smaller  amount  to  DO  to  recognize  the  additional  in- 
formation derived  about  the  other  variables  if  the  surface  is  not  spherical. 

The  additional  volume,  Vg  - Vg,  is  scaled  by  Vg/Vg  so  that  the  most  this  term 
can  add,  as  Vg  «>,  is  Vg  and  so  that  DO  is  zero  if  there  is  any  direction  in 
which  the  system  does  not  derive  information  at  all.  This  is  the  case  of 
traditional  unobservablllty , and  Vg  = 0.  The  nth  root  of  the  weighted  volume 
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is  taken  as  the  observability  measure  to  make  it  proportional  to  the  linear  di- 
mensions of  the  region  in  which  information  is  produced.  The  volume  weighting 
scheme  for  a two-dimensional  case  (volumes  are  areas)  is  depicted  in  Figures 
1 (a-c) . 

Once  one  accepts  (9)  as  a reasonable  assessment  of  the  observability  of  the 
system,  what  remains  to  be  shown  are  the  mechanics  of  computing  the  n-dimension- 
al  volumes  Vg  and  V£.  The  volume  enclosed  in  any  quadratic  surface  is  propor- 
tional to  the  product  of  the  reciprocals  of  the  square  roots  of  the  eigenvalues 
of  the  matrix  which  generates  the  quadratic  surface.  Since  volume  has  little 
intuitive  significance  in  spaces  of  dimension  greater  than  3,  we  omit  the  con- 
stant of  proportionality  and  the  volume  is  taken  to  be  simply 


(10) 


To  apply  this  result  to  the  case  at  hand,  first  substitute  (8)  into  (7)  to 
obtain  the  equation  of  the  ellipsoidal  surface  in  the  space  of  z: 

1 = z^  (FJ(T)F^'^)  z (11) 

T 

Vg  is  then  given  by  (10)  where  A.  are  the  eigenvalues  of  (FJ(T)F  ) . , More 

simply,  if  denotes  the  eigenvalues  of  FJ(T)f'^,  the  ellipsoidal  volume  is  also 
given  by 


n 

V = ir  /ui  (12) 

hi 

i=i 


and  the  spherical  volume  is  the  shortest  distance  to  the  surface,  1/ » to 
the  nth  power,  or  alternatively. 


V 


S 


(13) 


The  remaining  issue  is  computation  of  J(T).  It  is  always  possible  to 
integrate  (6)  for  an  interval  T starting  from  a zero  initial  condition.  But  in 
Ref.  3 an  analytic  solution  is  given  for  the  case  of  flexible  space  structure 
dynamics  expressed  in  decoupled  modal  form.  Since  it  is  always  possible  to  ex- 
press the  dynamics  in  this  form  it  is  always  possible  to  compute  the  Degree  of 
Observability,  DO,  by  evaluating  closed-form  expressions  and  solving  for  the 
eigenvalues  of  a real,  symmetric  matrix. 


RECOGNITION  OF  COMPONENT  FAILURES 

Because  of  the  realistic  possibility  of  components  failing  during  the 
operating  lifetime  of  the  system,  one  would  like  the  Degree  of  Observability  to 
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be  averaged  in  some  way  over  the  set  of  component  failure  combinations  which 
the  system  may  experience.  To  this  end,  let  f be  an  indicator  of  the  state  of 
failures  of  the  components,  and  let  the  vector  ^ represent  their  locations. 

Then  for  a given  set  of  operating  sensors,,  one  can  compute  the  Degree  of  Obser- 
vability, DO(£^',f),  using  the  method  previously  described. 

The  component  locations  indicated  by  ^ are  deterministic;  they  will  subse- 
quently be  adjusted  to  optimize  the  Degree  of  Observability.  But  f is  a random 
variable  with  a time- dependent  probability  distribution.  Thus  DO(^,f)  is  also 
a random  variable  with  a time- dependent  probability  distribution  defined  by  the 
distribution  of  f.  To  define  a meaningful  deterministic  performance  measure, 
one  would  logically  use  the  expected  value  of  D0(_£, f)  with  the  expectation  taken 
over  the  distribution  of  f,  the  failure  state  for  the  system  components.  This 
yields  a performance  measure  which  depends  on  time,  t.  It  represents  a measure 
of  the  expected  performance  of  the  system  at  time  t in  view  of  the  probabilities 
of  the  various  failure  states  at  that  time. 

But  this  control  system  is  required  to  operate  over  a certain  period  Tj^ 
which  might  represent  the  time  between  maintenance  visits.  Rather  than  optimize 
the  Degree  of  Observability  at  any  one  time,  such  as  the  end  of  that  period,  it 
would  seem  more  meaningful  to  optimize  the  average  Degree  of  Observability  over 
the  whole  period.  • In  this  average,  the  performance  resulting  from  failure  states 
which  are  likely  over  longer  periods  would  be  weighted  more  heavily  than  those 
likely  to  exist  over  shorter  periods.  And  a probability-weighted  measure  of 
performance  over  the  whole  operating,  period  ,1s  obtained  rather  than  just  a 
measure  of  performance  at  one  time. 

The  average  of  the  expected  Degree  of  Observability  over  the  mission  period 

T is  taken  as  the  final  measure: 
m 


DO 


AVE 


T 

m 


D0(^,  f)  dt 


(14) 


But  the  expected  DO  is  simply  a weighted  sum  over  the  different  failure  states, 


DO(l,f)  = DO(f,f^)  P^(t) 
i 


(15) 


where  ?£(t)  Is  the  probability  of  failure  state  fj^  at  time  t. 
can  be  expressed  as 


do^ve^)  - 


2 DO{t,fj) 
i 


m 


/ 


m 


p^(t) 


dt 


The  final  measure 


(16) 


and  depends  on  Tj^  and  the  component  failure  statistics  as  well  as  the  locations. 
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To  illustrate  the  calculation  of  the  average  probabilities  for  the  failure 
states,  take  the  usual  assumptions  of  independence  of  component  failures  and 
the  exponential  distribution  of  time  to  failure  for  each  component . Then  for 
the  jth  component,  the  probability  that  it  is  working  at  time  t is 

-A . t 

P(jth  component  working  at  t)  = e ^ (17) 

where  is  the  failure  rate  for  this  component,  the  reciprocal  of  its  mean  time 
to  failure.  Let  the  ith  failure  state  be  characterized  by  two  sets  of  indices, 
Jy  and  Jp,  with  all  components  having  indices  j in  the  set  Jy  working  and  all 
components  having  indices  j in  the  set  Jp  failed.  Note  that  the  index  of  each 
component  in  the  system  must  be  contained  in  one  or  the  other  of  Jy  or  Jp,  but 
not  both.  Then  the  probability  of  this  failure  state  at  t is 


P^(t)  = 


1 

1 

J 

p “X  , t -1 

3 

2 

n e 

n (1  - e ) 

JeJp 

With  the  definition 


A 


W 


A. 

3 


this  can  be  written  as 


(18). 


(19) 


-At  -A.t 

P.  (t)  = e n (1  - e ^ ) (20) 

^ jeJp 

The  average,  over  the  mission  period  Tjy[,  of  this  probability^— as  is  required  for 
the  calculation  of  the  Degree  of  Observability  given  in  Eq.  (16)— can  be  ex- 
pressed as 


M 


M 

/ 


P.(t)  dt 
1 


2 (-1)^  Sum(k) 

k=0 


(21) 


where 


Sum(k.)  = 2 


^=1  pj)T^ 


-(A  + 2 X.)T  . 

w j M 


(22) 


NF 

" k!  (N  -k)! 


N = the  number  of  elements  in  J (the  number  of  failed  components) 

r r 


2 A . = for  each  I,  the  sum  of  a different  combination  of  k A . with  ieJ„ 
r .1  1 -J  F 


k 


The  first  term  in  the  sum  of  Eq.  (21)  requires  interpretation  in  the  case  of  no 
working  components.  In  the  usual  case  with  some  components  working,  A^  given 
by  Eq.  (19)  is  greater  than  zero  and 


Sum(O)  = 


^w'^M 


(1 


(23) 


If  there  are  no  working  components,  define  A = 0,  and  Sum(O)  = 1. 

w 

These  expressions  can  be  simplified  in  the  special  case  of  all  component 
failure  rates  equal.  Call  the  number  of  working  components  Ny  and  the  number 
of  failed  components  Np  as  before.  Then  if  all  A.  = A, 


P^(t)dt 


^'F 

= 2 

k=0 


k NP 

<k> 

% + k)AT^ 


1 - e 


As  before,  if 


0,  the  term  corresponding  to  k = 0 is  1. 


(24) 


OPTIMUM  SENSOR  PLACEMENT 

Having  a computable  measure  of  how  well  the  structure  can  be  observed  with 
any  given  set  of  sensors,  with  the  expected  effect  of  component  failures 
throughout  the  mission  reflected  in  the  measure,  one  can  then  seek  to  optimize 
the  choice  of  component  locations,  for  a given  number,  so  as  to  maximize  the 
performance  measure.  This  task  may  be  computationally  burdensome  when  dealing 
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with  a large  number  of  components  but  it  is  conceptually  straightforward. 

A constraint  which  will  likely  apply  in  most  applications  is  that  sensor 
placement  will  be  restricted  to  a discrete  set  of  permissible  locations.  Struc- 
tural considerations,  for  example,  may  require  that  instrument  packages  be 
mounted  only  at  the  joints  of  a truss  structure.  If  this  is  true  of  all  the 
components,  then  the  placement  optimization  problem  is  in  the  nature  of  an  inte- 
ger programming  problem.  Many  algorithms  have  been  described  in  the  literature 
for  solving  integer  programming  problems;  nothing  has  been  added  to  that  art  in 
this  work.  The  examples  which  follow  are  intended  only  to  illustrate  the  nature 
of  this  step.  They  were  restricted  to  a small  number  of  components  and  optimi- 
zation was  accomplished  by  global  search — by  testing  all  admissible  combinations 
of  component  locations. 


CHOICE  OF  COMPONENT  NUMBER 

Having  the  optimum  set  of  sensor  locations  and  the  corresponding  maximum 
Degree  of  Observability  for  a given  number  of  components,  one  can  compute  this 
maximum  performance  measure  for  several  choices  of  component  number.  The 
choice  of  how  many  sensors  to  use  in  the  system  cannot  be  resolved  as  an 
optimization  problem  unless  additional  factors  are  incorporated  in  the  cri- 
terion. The  Degree  of  Observability  will  always  improve  with  additional  sen- 
sors if  the  best  locations  are  used  in  each  case. 

However,  it  should  be  informative  to  observe  the  trend  of  the  performance 
measure  with  number  of  components.  Some  locations  are  more  advantageous  than 
others — such  as  the  placement  of  rate  gyros  near  the  nodes  of  important  modes. 
With  the  realistic  restriction  that  only  one  component  can  be  placed  at  any 
one  of  the  allowable  locations,  one  should  expect  to  see  diminishing  returns  , 
in  performance  with  Increasing  number  as  the  more  favorable  locations  are 
occupied.  This  Information  should  be  helpful  to  the  designer  in  making  the 
trade-off  between  Improved  performance  and  increased  cost,  power  required,  etc. 


APPLICATION  TO  BEAM 

To  Illustrate  the  methodology  defined  above,  sensor  placement  and  number 
were  considered  for  the  case  of  a free-free  beam.  The  beam  was  modeled  as  in 
Reference  3 with  the  states  representing  the  modal  amplitudes  and  rates  of 
the  first  three  flex  modes;  translation  rate  sensors  were  used  for  observation. 
In  all  trials  the  amplitude  rate  states  were  scaled  by  the  factor  top  relative 
to  the  amplitude  states  where  is  the  corresponding  modal  frequency.  The 
sensors  were  assumed  equally  noisy  (N  = I) . As  an  aid  to  interpretation  of 
the  results  that  follow,  the  mode  shapes  for  the  three  modeled  modes  are  shown 
in  Figure  2. 

The  Degree  of  Observability  as  a function  of  the  position  of  a single 
translation  rate  sensor  along  the  length  of  the  beam  is  shown  in  Figure  3 with 
the  beam  modeled  by  the  dynamics. of  the  first  mode  only.  This  case  is  shown 
to  demonstrate  that  the  computed  DO  gives  the  result  that  is  intuitively  ob- 
vious in  this  instance— the  DO  reflects  the  shape  of  the  first  mcie  and  is  zero 
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when  the  sensor  is  located  at  either  node  because  the  mode  is  unobservable  by 
this  type  of  sensor  placed  at  a node.  The  use  of  the  observability  measure 
will  be  more  important  when  dealing  with  more  complex  systems  where  the  results 
will  not  be  so  predictable.  A suggestion  of  this  is  given  in  Figure  4 which 
shows  the  DO  as  a function  of  the  location  of  a single  sensor  with  the  beam 
modeled  by  the  dynamics  of  the  first  three  flex  modes.  In  this  case  the  pat- 
tern is  more  complex  with  the  measure  going  to  zero  at  each  node  of  any  mode — 
because  unobservability  of  any  decoupled  mode  constitutes  unobservability  of 
the  system.  No  sensor  failures  have  been  considered  in  these  results. 

The  Degree  of  Observability  achieved  with  two  sensors  is  shown  in  Figure  5 
with  the  beam,  represented  by  the  dynamics  of  the  first  two  flex  modes.  In  this 
case  one  sensor  is  fixed  at  the  end  of  the  beam  and  DO  is  shown  as  a function 
of  the  location  of  the  other  sensor.  At  no  point  does  DO  become  zero  because 
both  modes  are  observable  by  the  sensor  at  the  end.  The  observability  measure 
is  dominated  by  the  observability  of  the  first  mode  as  is  indicated  by  the 
similarity  of  this  curve  to  that  of  Figure  3 for  first  mode  dynamics  only. 

The  effect  of  possible  sensor  failures  on  the  observability  measure  is 
indicated  in  Figure  6 which  shows  the  Degree  of  Observability  for  a two-mode 
representation  of  the  dynamics  of  the  beam  as  a function  of  the  location  of  one 
sensor  with  another  fixed  at  the  end  of  the  beam.  The  upper  curve  is  the  same 
as  Figure  5 in  which  no  failures  are  accounted  for.  The  lower  curve  in  Figure 
6 is  the  average  expected  DO  with  the  mean  time  to  failure  for  each  sensor 
equal  to  the  mission  period.  This  gives  a probability  of  0.63  that  each  sensor 
will  fall  before  the  end  of  the  mission.  This- system  is  unobservable  if  both 
sensors  fail  or  if  one  sensor  is  located  at  a node  of  either  mode  and  the  other 
sensor  falls.  These  possibilities  are  accounted  for  with  the  proper  probabili- 
ties in  the  measure  of  observability. 

As  in  Figure  5,  the  DO  in  Figure  6 is  dominated  by  the  observability  of  the 

first  mode.  In  this  case  the  maximum  tolerable  error  in  the  est.imate  of  both 

modal  amplitudes  was  taken  equal  (ei  = eo  ) . This  characteristic  is 

■^max  ^max 

changed  if  the -relative  importance  of  the  two  modes  is  different.  This  is  il- 
lustrated in  Figure  7 which  is  the  same  case  as  in  Figure  6 except  that  the 
second  mode  has  been  declared  5 times  more  sensitive  than  the  first  mode 

(e']  = 1/5  ei  ).  The  relative  weighting  of  each  rate  state  to  the  corre- 

■^max  -‘•max 

spending  amplitude  state  has  been  retained  as  the  modal  frequency,  as  be- 

fore. In  this  figure,  the  DO  as  a function  of  the  location  of  the  variable 
sensor  has  essentially  the  character  of  the  magnitude  of  the  second  mode  shape. 
This  is  because  of  the  greater  Importance  that  has  been  assigned  to  that  mode. 

Figure  7 also  illustrates  a very  Important  point  related  to  the  Importance 
of  accounting  for  the  possibility  of  component  failures  early  in  the  design 
process.  That  point  is  the  fact  that  recognition  of  the  likely  effects  of  com- 
ponent failures  can  alter  the  optimal  locations  for  those  components.  The 
quantitative  signlf icance ; of . this  point  is  small  in  this  example  but  the  quali- 
tative truth  of  the  proposition  is  established.  Looking  at  the. upper  curve  in 
Figure  7 one  can  see  that  if  the  possibility  of  sensor  failures  is  Ignored,  the 
Degree  of  Observability  is  nearly  the  same  with  one  sensor  at  the  end  of  the 
beam  and  a second  sensor  located  either  at  test  position  2 or  7— actually,  the 
observability  is  slightly  better  at  position  7.  So  without  considering  failures 
one  would  say  that  the  optimum  locationa  for  two  sensors  on  the  beam  are  at 
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test  positions  1 and  7.  But  from  the  lower  curve  which  accounts  for  the  likely 
failures  it  is  clear  that  locations  1 and  2 are  optimal  in  that  case.  In  draw- 
ing these  conclusions  we  hold  the  constraint  that  only  one  sensor  can  be  al- 
lowed at  one  location  and  only  the  test  positions  shown  in  Figure  2 are  per- 
mitted. In  particular,  we  search  only  half  the  length  of  the  beam;  sensors 
located  on  the  other  half  of  the  beam  would  give  symmetric  results. 

The  interpretation  of  the  results  on  optimum  sensor  placement  is  that  po- 
sition 7 is  near  the  antinode  of  mode  2 and  so  is  a good  location  for  observing 
that  mode  which  has  been  given  great  importance  in  this  example.  But  position 
7 is  near  the  node  of  mode  1 and  so  is  not  a good  choice  of  location  for  observ- 
ing that  mode.  If  no  failures  are  admitted,  the  optimal  sensor  locations  rep- 
resent the  risky  choice  of  placing  one  sensor  at  location  7 to  gain  good  ob- 
servability of  mode  2 because  mode  1 is  well  observed  by  the  sensor  at  position 
1,  the  end  of  the  beam.  But  if  failures  are  anticipated,  the  failure  of  the 
sensor  at  position  1 leaves  mode  1 so  poorly  observed  by  a sensor  at  position  7 
that  the  more  conservative  choice  of  placing  the  second  sensor  at  position  2 is 
preferable.  At  position  2,  the  sensor  does  not  observe  the  second  mode  quite 
as  well,  but  has  better  observability  of  mode  1 compared  with  a sensor  at  posi- 
tion 7. 

Finally,  the  variation  of  Degree  of  Observability  with  optimized  sensor 
locations  as  the  number  of  sensors  is  Increased  is  shown  in  Figure  8.  In  gene- 
ral one  would  expect  to  see  diminishing  returns  of  observability,  with  more  sen- 
sors as  the  more  favorable  locations  are  occupied.  The  designer  must  then  base 
his  choice  of  number  of  sensors  to  use  on  the  trade-off  of  increased  performance 
against  Increased  cost,  weight,  power,  etc.  In  this  case  diminishing  returns 
are  not  evident  in  Figure  8 because  there  are  several  locations  which  are  near- 
ly equally  effective  after  the  position  at  the  end  of  the  beam  has  been  occupied. 
If  the  curve  were  extended  to  a larger  number  of  sensors  it  would  surely  begin 
to  flatten  out. 


CONCLUSIONS 

A methodology  has  been  presented  which  is  intended  to  assist  the  designer 
of  a control  system  for  a large  space  structure  to  decide  how  many  sensors 
should  be  incorporated  in  the  system  and  where  they  should  be  placed  on  the 
structure.  This  approach  is  intended  to  be  especially  useful  in  the  early 
stages  of  the  evolution  of  the  system,  before  a complete  control  system  con- 
cept has  been  defined.  This  methodology  is  based  on  a quantitative  measure  of 
the  observability  of  the  system  for  a given  set  of  sensors.  The  effect  of  pos- 
sible component  failures  during  the  mission  period  was  Incorporated  in  the 
measure.  The  question  of  sensor  placement  is  then  resolved  by  finding  the  lo- 
cations which  maximize  the  performance  measure.  The  number  of  components  to 
use  cannot  be  determined  by  optimizing  the  measure  because  the  observability 
always  improves  with  Increased  number  of  sensors  if  they  are  optimally  located. 
However,  the  improvement  in  the  measure  with  component  number  can  be  deter- 
mined, and  this  information  can  be  used  along  with  data  on  cost,  power  required, 
etc.  to  decide  how  many  sensors  to  use. 
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FIGURES 


Figures  la-c;  Volume  weighting  scheme  for  computing  degree  of 

observability  - (a)  ideal  distribution,  (b)  slightly  distorted 
distribution,  (c)  very  distorted  distribution.  Shading  indicates 
relative  weighting. 
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Figure  5.  Degree  of  Observability  with  Two  Sensors. 


Figure  6 
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Figure  7.  Degree  of  Observability  with  Two  Sensors. 
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Figure  8.  Maximum  DO  vs.  Number  of  Sensors. 
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THE  TOYSAT  STRUCTURAL  CONTROL  EXPERIMENT 

J.A.  Breakwdi  and  G.J.  Chambers 

Lockheed  Missiles  and  Space  Company 
Sunnyvale,  CA  94304 


ABSTRACT/FOREWORD 

This  paper  describes  in  detail  the  Lockheed  TOYSAT  experiment.  The  experiment 
was  designed  to  test  hypothesis  concerning  the  application  of  optimal  control 
theory  to  flexible  spacecraft.  The  theory  is  presented,  and  results  described. 

Hardware  Setup.  A drawing  of  the  TOYSAT  experimental  setup  is  shown  in  Fig.  1. 
The  test  setup  consists  of  a 12-ft.  flexible  beam  fastened  to  the  side  of  a 
1.3-ft.  square  block  of  aluminum.  To  accentuate  bending,  a 2-lb.  weight  is 
attached  to  each  end  of  the  beam.  The  beam  is  suspended  from  the  ceiling  in 
such  a way  as  to  allow  free  motion  in  the  horizontal  plane. 

Control  of  the  motion  of  the  specimen  is  provided  by  two  linear  actuators. 

When  the  actuators  act  together,  translational  motion  results,  and  when  equal 
but  opposite  commands  are  given,  pure  rotational  motion  results.  Sensing  is 
provided  by  accelerometers  mounted  at  the  ends  of  the  flexible  beam  and  by 
linear  position  sensors  mounted  in  tandem  with  the  linear  actuators.  The 
average  of  the  two  accelerometer  measurements  indicates  the  translational 
motion,  and  their  difference  measures  the  rotational  motion.  The  same  holds 
true  for  the  position  sensors.  The  separation  of  rotation  and  translation  in 
both  controller  and  sensor,  as  well  as  in  the  physical  behavior  of  the  TOYSAT, 
allows  independent  and  separate  design  of  translational  and  rotational  control 
1 aws . 


Figure  2 shows  how  the  TOYSAT  is  connected  by  way  of  A/D  and  D/A  converters  to 
a computer  where  the  digital  control  law  is  implemented.  The  large  number  of 
computations  necessary  to  carry  out  the  control  law  is  handled  by  an  array 
processor  which  allows  a sampling  rate  of  80  Hz. 

The  frequencies,  dampings,  and  mode  shapes  for  the  assembled  system  are  shown 
in  Table  1.  These  data  result  from  a judicious  combination  of  analysis  and 
testing.  The  rigid  body  modes  and  the  first  two  bending  modes  in  both  trans- 
lation and  rotation  are  used  in  controls  synthesis  and  implementation.  The 
remaining  modes  are  used  to  evaluate  the  effect  of  spillover.  The  frequencies 
and  dampings  (ui,  are  assembled  in  the  usual  block  diagonal  form  to 
give  the  F matrix,  t!ie  ni  give  the  G matrix,  and  the  (o)  and  (^) 
give  the  H^  and  H2  matrix  components  corresponding  to  the  linear  sensors 
and  accelerometers  respectively.  (These  matrices  are  defined  by  their  use  in 
Eq.  2). 


Controls  Synthesis.  The  controller  implemented  in  the  hardware  has  the  form: 


A 

X 


n+1 


4 '5^  * r'u^  * Kz„ 


Cx 


n’ 


(1) 
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Fig.  1 TOYSAT  Test  Setup 
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Fig.  2 Measurement  and  Control  Flow  Diagram 
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Table  1 TOYSAT  MODAL  FREQUENCIES,  DAMPINGS,  MODE  SHAPES, 
AND  INFLUENCE  COEFFICIENTS 


0).  (Hz) 

^ (§) 

(6^-  (0) 

(^) 

Trans 

R.B. 

1 

2 

3 

4 

5 

6 

0.19 

0.98 

9.06 

28.02 

57.69 

98.15 

149.38 

1 

1.0 

0.0842 

0.0365 

0.0210 

0.0147 

0.0113 

0.0092 

1.0 

-0.952 

0.365 

-0.229 

0.0164 

-0.128 

0.105 

3.24 

4.21 

1.83 

1.05 

0.735 

0.565 

0.460 

RT 

R.B. 

1 

2 

3 

4 

5 

6 

0.0 

2.64 

9.87 

28.4 

57.94 

98.33 

149.53 

1 

1.0 

0.252 

0.127 

0.055 

0.033 

0.022 

0.017 

I 

0.00712 

1.54 

0.774 

0.335 

0.201 

0.134 

0.104 

Total  Mass 
Total  Inertia  = 

2.708  slugs 
140.37  in.-lb-s2 

*An  approximation  for  a phenomenon  that  cannot  be 
accounted  for  by  modal  damping. 

where  denotes  the  transition  matrix,  and  r‘ , K,  C are  constant  matrices  to 
be  determined.  This  deceptively  simple  form  conceals  a multitude  of  com- 
plexities involved  in  choosing  the  gains  K and  C.  The  controller  separates 
naturally  into  a part  which  estimates  the  modal  amplitudes  and  another  part 
which  outputs  these  estimates,  multiplied  by  some  gain,  to  the  control 
actuators.  In  accordance  with  this  separation  we  will  present  first  the 
estimator  synthesis  and  then  the  control  gains  synthesis. 

Filter  Synthesis.  The  problem  is  defined  in  the  continuous  domain  by  Eq.  2 
below^i  (The  state  covariance  is  specified  in  the  continuous  domain). 


X = Fx  + Gu  + w^ 

z = H.x  + H5X  + • _ 
12c 


(2) 
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W(-  is  white  noise,  zero  mean,  covariance  Q^.; 

Vc  is  white  noise,  zero  mean,  covariance  R^. 

The  extra  term  H2  is  necessary  because  the  standard  form  does  not  allow  for 
acceleration  measurements.  The  specific  entries  in  the  matrices  for  the 
separate  translation  and  rotation  plants  are  given  in  Table  1. 

The  first  step  in  the  synthesis  process  is  to  convert  the  continuous  system 
given  in  Eg.  2 to  a discrete  system. 


z = X + H^x^  + V 
n 1 n 2 n 


(3) 


W(j  is  zero  mean  white  noise,  covariance  Qj; 

V is  zero  mean  white  noise,  covariance  R. 

T 

^d  = MVdl=  I ^(t)  Qc^'(t)  dt  (4) 

0 


i and  A are  derived  from  F and  16  using  standard  numerical  matrix  exponential 
procedures.  T is  the  step  length.  The  covariance  of  the  discrete  noise  R is 
taken  as  an  input  and  is  not  derived  from  the  continuous  measurement  noise 
R(..  Eq.  3 is  put  into  standard  Kalman  filter  form  in  two  steps.  First  x^ 
equation  by  means  of  Eq.  2,  leaving 


= ^l"n 


" H2FXn 


H2GUn 


H2W^ 


(5) 


Then  to  the  first  of  Eq.  3 we  add  the  identically  zero  quantity 

L K - "I’^n  - 

- HjGu^  - - vj  (6) 

where  L is  to  be  determined  later.  The  result  is 

Vl  - [*-  - LH/]  x„ 

.*  [r-LH2G)u„*Lz„ 

+ Wj  - LH2W^  - Lv.  (7) 
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Equations  5 and  7 together  constitute  the  usual  Kalman  filter  problem  if  L is 
chosen  so  that  the  measurement  noise  (H2Wc  + v)  and  the  state  noise 
(W(j  - LH2Wc  - Lv)  are  uncorrelated.  Equation  8 shows  the  choice  of  L 
that  accomplishes  this. 

L = #(T)  A ^ 

Standard  optimal  techniques  may  now  be  used  to  obtain  the  filter  gains  K.  The 
resulting  filter  has  the  form 

Xn+i  = (L  + K)(Hj^  + H2F)]  . 

+ [r-  (L  + K)  H2G]  + (L  + K)  (9) 

and  r' shown  in  Eq.  1 are  just  the  appropriate  coefficients  of  Xj,  and  u^^ 
from  Eq.  9. 

Control  Gains.  The  control  segment  of  the  problem  is  specified  by 

X = Fx  + Gu  (10) 

with  the  cost  functional 

00 

J = J (x^Ax  + u^Bu)  dt  . 

0 

Again,  as  with  the  filter,  this  problem  is  converted  to  a discrete  problem 

^n+1  = #x^  + ru^  , (11) 

but  now  the  cost  function  is  specified  by 

00 

- 2 (’'J  * “I  \ 2 “J  “’’n)- 

n=l 

where  the  weights  Q,  S,  R,  of  the  discrete  problem  are  given  by 

-T 

Q = j (t)  A $(t)  dt 
0 
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(t)  A $(t)  dt 


je  + r'*'  (t)  A r(t)j  dt  (13) 


Equations  11  and  12  specify  a standard  discrete  optimal  control  problem  that  is 
easily  solved  by  standard  methods.  The  resulting  gains  C are  those  implemented 
in  the  controller  specified  by  Eq.  1. 

Evaluation  of  Controller.  When  the  six-state  controller  for  either  translation 
or  rotation  is  completed,  it  is  tested  on  an  augmented  fourteen-state 
evaluation  model.  The  eigenvalues  of  the  six-state  controller  operating  on  the 
fourteen-state  plant  are  investigated  for  signs  of  spillover.  These 
eigenvalues  are  those  of  the  matrix  shown  in  Fig.  3 The  superscripts  14  in 
this  figure  refer  to  the  fourteen-state  evaluation  model. 

Specific  Controls  Design.  The  actual  gain  selection  procedure,  like  the 
theory,  divides  into  separate  estimator  and  control  gains  selections.  First  a 
filter  was  designed  and  tested  on  the  open-loop  system.  When  that  seemed  to  be 
working,  the  loop  was  closed  with  an  appropriate  set  of  control  gains. 

To  obtain  filter  gains,  the  parameters  to  be  selected  were  the  continuous  state 
noise  covariance  matrix  Qc  (defined  in  Eq.  2)  a six-by-six  matrix  for  either 
translation  or  rotation,  and  the  state  noise  covariance  matrix  R (defined  in 
Eq.  3),  a two-by-two  matrix  assumed  to  be  diagonal. 

The  approach  to  choosing  and  R was  to  pick  so  as  to  maximize  the 
separation  of  the  estimator  poles.  When  done  correctly,  this  assures  that  the 
separate  states  in  the  Kalman  filter  contain  just  the  intended  modal  amplitude 
and  very  little  of  the  others.  That  is,  is  picked  to  minimize  crosstalk 
between  modes  in  the  Kalman  filter. 


#14 


c 


(K  + L)  [hJ^  + F^^] 


# - (K  + L)  [h^  + F + (K  + L)  g]  C 


+ (K  + L)  C 


Fig.  3 Matrix  For  Eigenvalues  of  Six-State  Controller 
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Figure  4 shows  the  time  history  of  the  estimates  of  the  state  vector  for  a good 
choice  of  Q^..  The  specimen  had  been  excited  by  a four-second  chirp  (a  fast 
sine  sweep  varying  linearly  in  frequency),  and  the  outputs  of  the  estimator 
were  recordcid.  The  rigid  body  channels  contain  all  the  low  frequency  movement, 
while  the  modal  channels  separate  the  remaining  frequencies  nicely. 

The  measurement  covariance  matrix  R was  chosen  to  force  good  convergence  of 
modal  reconstruction  of  the  measurements  with  the  actual  measurements.  That 
is,  individual  components  of  R are  increased  until  corresponding  components  of 
Hx  converge  with  the  actual  measurements  fairly  quickly.  Figures  5 and  6 show 
that  the  modal  reconstruction  of  the  rotational  displacement  and  tip 
acceleration  converge  quite  quickly  with  the  actual  measurements.  Again,  in 
this  case  the  specimen  had  been  perturbed  by  a four-second  chirp  and  then 
allowed  to  settle. 

Once  a filter  design  has  been  established,  the  selection  of  control  gains  is 
straightforward.  Since  there  was  no  obvious  physical  quantity  such  as  a line 
of  sight  on  the  TOYSAT,  the  control  weights  were  picked  to  give  fast  and 
roughly  equal  settling  times  for  all  modes.  The  values  for  the  covariance 
matrices  and  control  weighting  matrices  which  generated  the  nominal  controller 
are  shown  in  Table  2 for  the  translation  controller  and  in  Table  10  for  the 
rotation  controller.  Also  shown  are  the  resulting  filter  and  control  pole 
locations. 

TOYSAT  Experimental  Results.  The  hardware  tests  of  the  control  system 
described  in  the  previous  section  are  slight  variations  of  a standard  modal 
identification  tests.  A four-second  chirp  containing  frequencies  from  0 to 
40  Hz  is  applied  simultaneously  in  rotation  and  translation  through  the 
Electroseis  shakers.  This  open-loop  command  is  superimposed  upon  any  feedback 
commands  which  may  be  generated  by  the  control  law  given  in  Eq.  1.  The 
open-loop  chirp  command  is  also  communicated  to  the  estimator.  Data  are  taken 
for  twelve  seconds.  Since  the  chirp  begins  sometime  in  the  first  second,  this 
allows  for  roughly  eight  seconds  of  data  after  the  chirp  has  expired.  Data 
saved  during  the  test  run  include  the  open-  and  closed-loop  control  commands, 
all  four  measurements,  and  the  twelve  Kalman  filter  states  during  the  run. 
Post-experiment  processing  allows  analysis  of  the  data  in  the  frequency  domain 
as  well  as  in  the  time  domain. 

Figures  7 through  12  illustrate  various  forms  of  results  from  the  control 
system  test  described  in  Tables  2 and  3.  Figures  7 and  8 show  comparisons  of 
the  four  measurements  taken  during  two  separate  test  runs;  one  without  con- 
trols, and  one  with  the  control  system  loop  control.  The  closed-loop  data  show 
much  faster  settling  time  than  do  the  data  from  the  open-loop  test.  Also,  peak 
excursions  during  the  chirp  excitation  are  considerably  smaller.  Figure  9 
shows  the  Kalman  filter  states  during  the  closed-loop  test.  This  may  be 
compared  with  Fig.  4 which  describes  the  same  states  during  the  open-loop  test. 

Figures  10  and  11  present  frequency  domain  comparisons  of  open-  and  closed-loop 
transfer  functions  for  tip  accelerations  to  inputs  through  the  central  shakers. 
Figure  10  shows  that  the  translation  controller  reduces  the  response  of  the 
first  two  bending  modes  (at  1 and  9 Hz)  to  below  the  level  of  the  third 
unmodeled  bending  mode  (28  Hz).  The  first  mode  amplitude  is  reduced  by  a 
factor  of  eight,  a number  which  is  proportional  to  the  increase  in  damping. 
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CENTRAL  BODY  ROTATION  (r»d) 
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Fig.  4 Open-Loop  Kalman  Filter  Output  (Response  to  4-s  Chirp) 


Fig.  5 Measured  and  Predicted  Open-Loop  Central  Body  Rotation 
(Response  to  4-s  Chirp) 
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ROTATIONAL  TIP  ACCELERATION  (ih./> 


Fig.  6 Measured  and  Predicted  Open-Loop  Rotational  Tip  Acceleration 
(Response  to  4-s  Chirp) 

TRANSLATION  TIME  (•)  ROTATION  TIMEIi) 
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0 S to  0 $ 10 


TIME  (s)  TIME  (s) 

Fig.  7 Open-  and  Closed-Loop  Central  Body  Measurements 
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Fig.  8 Open-  and  Closed-Loop  Tip  Acceleration  Measurements 
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Fig.  9 Closed-Loop  Kalman  Filter  Output  (Response  to  4-s  Chirp) 
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Fig.  10  Open-  and  Closed-Loop  Transfer  Functions;  Translational  Tip 
Acceleration/ Input  Chirp 
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Fig.  11  Open-  and  Closed-Loop  Transfer  Functions;  Rotational  Tip 
Acceleration/ Input  Chirp 
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TEST:  A 

PREDICTION  :o 


Fig.  12  Closed-Loop  Pole  Location;  Predicted  Versus  Test 


The  damping  was  measured  by  fitting  second  order  models  to  the  transfer 
function.  The  closed-loop  was  roughly  4 percent  for  the  first  mode  (versus 
0.29  percent  open-loop)  and  11  percent  for  the  second  mode  (versus  0.54  percent 
open-loop).  For  the  rotation  controller,  the  first  and  second  modes  (2.6  and 
9.8  Hz)  were  also  reduced  below  the  level  of  the  third  mode  (28  Hz).  Again, 
the  ratio  of  the  reduced  amplitude  was  proportional  to  the  increase  in 
oamping.  The  first  mode  closed-loop  damping  was  roughly  6 percent  (versus  0.33 
percent  open-loop),  and  the  second  mode  was  12  percent  (versus  0.3  percent 
open-loop) . 

The  damping  measured  from  test  data  does  not  agree  with  the  analytically 
predicted  pole  locations  shown  in  Tables  2 and  3.  The  predictions  for  the 
second  mode  behaviour  were  reasonably  close,  but  the  first  mode  predictions 
were  quite  inaccurate.  These  differences  are  dramatically  illustrated  by  the 
charts  in  Fig.  12. 

Figure  13  shows  the  result  of  a deliberate  attempt  to  provoke  the  phenomena 
commonly  known  as  spillover.  A new  control  system  was  generated  by  reducing 
the  weight  on  the  translation  control  by  a factor  of  one  hundred  from  that 
given  in  Table  2.  The  analysis  predicted  that  this  controller  would  cause  the 
first  unmodeled  mode  (at  29  Hz)  to  go  unstable.  Test  results  given  in  Fig.  13 
demonstrate  that  indeed  something  did  go  unstable.  The  frequency  domain 
transfer  function  in  Fig.  14  confirms  that  it  was  the  29  Hz  mode  which 
dominated  the  instability. 

Finally,  two  quick  test  runs  were  made,  more  as  demonstrations  to  suggest 
further  work  than  to  prove  any  point  in  themselves.  The  first  was  a 
demonstration  of  a low  sample  rate  Kalman  filter;  it  ran  at  20  Hz  while 
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Table  2 80  Hz  CONTROLLER  DESIGN  PARAMETERS 


1 0 -2 

0 

-3.7 

0 

0 0 

0 

0 

0 

A = 3.8 

0 

-0.71 

0 

0 

0 

0 

833 

0 

TRANSLATION 

B = 3 X 10-3 

Qc  = Diag.  (0.5  x 

ig-3,  0, 

0.02,  3 X 

10-5 

R = Diag.  (8  x 20 

-5,  8 X 

10-1) 

CONTROL  POLES 

FILTER 

POLES 

-4.2  ± i 0.808 

5.2  ± 

i 5.3 

-5.1  ± i 9.1 

-9.8  ± 

i 5.8 

-6.7  ± i 56.7 

-28.7  ± 

i 50.2 

Table  3 80  Hz  CONTROLLER  DESIGN  PARAMETERS 


ROTATION 

718 

A = C .22 

0 

12 

B = 1 X 10-5 


Qc  - 

R = 


Diag.  (0,  1.0  x 10-3,  q,  1.0,  0,  3 x 10"3) 
Diag.  (5  x lO”'^,  5 x 10-^ 


CONTROL  POLES 

-4.5  ± i 4.9 
-6.4  ± i 17.1 
-5.7  ± i 60.1 


FILTER  POLES 
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-2.4  ± i 2.5 
-12.3  ± i 10.4 
-33.8  ± i 54.9 


estimating  behavior  up  to  10  Hz.  The  results  were  not  spectacular,  but  they 
are  shown  in  Fig.  15.  It  shows  a comparison  of  the  filter  prediction  (z) 
with  the  actual  measurement  of  the  central  body  rotation.  Theory  suggests  that 
the  filters  should  work  at  even  lower  frequencies,  but  we  were  unable  to  make 
this  happen  in  the  test. 

The  last  figure.  Fig.  16  shows  a test  devised  in  response  to  criticisms 
claiming  that  the  optimal  control  theory  is  too  sensitive  to  knowledge  of  the 
system  parameters.  A two  pound  weight  was  removed  from  one  of  the  tips  of  the 
flexible  beam.  This  destroyed  the  symmetry  of  the  TOYSAT,  and  changed  its 
rotational  inertia  by  70  percent.  Yet  the  control  system  designed  to  work  on 
the  original  system  did  not  do  badly,  as  the  figure  shows.  Admittedly,  this  is 
not  a rigorous  test  of  robustness,  and  there  are  some  parameters  that  the 
control  system  must  know  very  accurately  in  order  to  function  properly. 

However,  it  is  a dramatic  exhibition  of  the  control  system  working  despite  a 
massive  perturbation. 


Fig.  15  Measured  and  Predicted  Open-Loop  Central  Body  Rotation  for  a 20  Hz 
Sample  Rate  Kalman  Filter 
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Fig.  16  Measured  Tip  Acceleration  for  Two  Closed-Loop  Tests,  One  With 
Nominal  Configuration,  One  With  One  2-lb  Tip  Mass  Removed 


Conclusion.  Long  after  concluding  the  testing,  we  found  an  analytical 
"oversight"  which  could  very  well  have  caused  the  discrepancy  between  test  and 
prediction  of  the  lowest  bending  mode  (Fig.  12).  Not  mentioned  in  the  filter 
synthesis  was  what  we  used  as  noise  statistics  to  get  the  gains  for  the  filter 
is  Eq.  7.  It's  rather  straight  forward  to  obtain  the  state  covariance  as 

Qd  - ^(t)  0^  hJ  l'*'  - L Qj,  0^(T)  + L Qj,  hJ  l'''  + LRl’’’ 

However  the  measurement  covariance  should  be  taken  as 

i (Hj  Q hJ  * R^) 

where  At  is  the  sample  time. 

We  hao  carelessly  taken  it  as 

^ Q hJ  + R 

until  Herb  Rauch  corrected  our  error. 
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The  result  of  choosing  the  latter  value  is  that  the  lower  frequency  modes  will 
rely  excessively  on  the  accelerometer  measurement  to  the  exclusion  of  the 
position  measurements.  Thus,  two  integration  constants  are  lost  (or  two  poles 
are  trappeo  at  the  origin).  This  is  particularly  disastrous  for  the  lower 
frequency  modes,  and  could  easily  have  caused  the  discrepancy  between  the 
predicted  lamping  result  and  that  obtained  by  test.  A further  correction  is 
that  the  right  hano  side  of  Eq.  8 should  be  preceded  by  a 1/2.  But  this  has 
little  effect  on  the  outcome. 
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ABSTRACT 

A model  reduction  procedure  based  on  aggregation  with  respect  to  sensor 
and  actuator  influences  rather  than  modes  is  presented  for  large  systems  of 
coupled  second-order  differential  equations.  Perturbation  expressions  which 
can  predict  the  effects  of  spillover  on  both  the  aggregated  and  residual 
states  are  derived.  These  expressions  lead  to  the  development  of  control 
system  design  constraints  which  are  sufficient  to  guarantee,  to  within  the 
validity  of  the  perturbations,  that  the  residual  states  are  not  destabilized  by 
control  systems  designed  from  the  reduced  model.  A numerical  example  is  pro- 
vided to  illustrate  the  application  of  the  aggregation  and  control  system 
design  method. 


INTRODUCTION 

The  model  reduction  issue  for  large  space  structures  is  particularly 
acute  because  model  reduction  is  essential  for  easy  finite-dimensional  control 
system  design,  yet  its  use  introduces  the  possibility  of  Inadvertent  system 
destabilization.  Ordinary  methods  of  model  truncation  result  in  control  energy 
being  coupled  to  the  neglected  system  through  spillover.  This  makes  a priori 
performance  estimation  difficult.  Furthermore,  the  control  gains  enter  the 
coupling  coefficients  so  that  the  control  system  design  and  evaluation  processes 
must  be  iterated.  By  contrast,  the  model  reduction  method  presented  here,  based 
on  aggregation  with  respect  to  sensor  and  actuator  influence  functions  rather 
than  the  system  dynamic  matrix,  results  in  spillover  coupling  that  is  indepen- 
dent of  the  control  system.  As  a consequence,  the  effect  of  spillover  can  be 
more  readily  predicted  and  is  done  here  using  perturbation  theory. 

Aggregation  with  respect  to  sensor  and  actuator  Influence  functions  was 
developed  for  first-order  systems  by  Johnson  and  Lin  [IJ.  In  this  paper,  the 
method  is  applied  to  systems  in  second-order  form  which  are  representative  of 
large  space  structures.  The  use  of  perturbation  theory  to  estimate  pole  shifts 
due  to  spillover  has  been  developed  by  Balas  [2]  and  Lang  [3].  However,  their 
expressions  involve  the  controller  gains  in  a way  which  is  more  difficult  to  use 
for  design  purposes  than  the  expressions  presented  here.  Aubrun  [4]  applied 
perturbation  theory  to  evaluate  the  effects  of  small  control  gains  on  stability. 
In  the  limit  of  small  control  gains  and  small  spillover,  his  approximation 
agrees  with  the  one  given  in  this  paper. 

The  present  model  reduction  procedure  has  numerical  advantages.  Further, 
it  allows  the  perturbation  results  to  be  extended  in  special  cases  to  yield 
control  system  design  constraints  sufficient  to  guarantee,  to  within  the 
validity  of  the  perturbations,  the  non-destabilization  of  the  residual  system. 
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Accordingly,  a control  system  should  be  designed  to  produce  acceptable  aggre- 
gated system  behavior,  and  to  satisfy  the  residual-system  stability  conditions. 
The  design  of  such  control  systems  is  more  complex.  The  compensating  advantage 
is  that  more  reliable  performance  estimates  can  be  obtained. 

AGGREGATION  OF  LSS  MODELS 

In  modal  analysis,  the  dynamic  model  for  many  large  space  structures 
(LSSs)  can  be  written  as 

n + pn  + - D p = Bu 

' e 

y = Cp  + Dp 

where  p is  an  n -vector  of  modal  deflections,  u is  an  n -vector  of  control 
p ’ u 

forces  and  y is  an  n -vector  of  modal  deflections  and  velocities.  The  matrices 

y 

p and  are  assumed  to  satisfy 


(1) 

(2) 


p = diag[3,  • • • * ,3]  ; = diag[oj^,  • • • • ,0)^  ] (3) 

n 

where  3 is  a uniform  modal  damping  coefficient,  and  co^  > 0,  i = l,***,n^  are  the 

squared  modal  frequencies.  The  matrix  D^  represents  the  destabilizing  forces  in 

the  system,  and  is  assumed  here  tO  be  of  the  form  D^  = diag [d , • • • ,d] . A general 

D^  can  actually  be  put  in  a form  readily  treated  with  the  present  analysis.  The 

ability  to  handle  open-loop  unstable  systems  is  important  to  certain  LSSs  [5] . 
The  open  loop  poles  of  the  system  (1)  are 


+ 


J 


+ d - oj. 


i = 1, 


Let 


c . + C 


C‘B  = 0 

D*B  = 0,  D*C^  = 0 


(4) 


(5) 

(6) 


T T ~ 

be  projection  decompositions  of  C along  B , and  D along  B and  C,  where  a super- 
script T denotes  algebraic  transposition.  Assuming  2 and  D are  of  full  rank, 
the  partitioned  matrix 
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-T  ~T 
[B  D ] 


~T 

B C D 

a a a 

cxT  ~T 
B C D 

r r r 


can  be  formed  where  E is  n x n and  E is  n x n , n = n + 2n  , and  n = 

a aa  r rra-n  y r 

n - n . If  C and  D are  not  of  full  rank,  only  their  significant  and  indepen- 
T|  a 

dent  rows  are  retained  and  n^  can  be  reduced.  C represents  the  component  of 

displacement  measurement  which  is  not  collocated  with  the  actuators,  and  D 
represents  the  component  of  velocity  measurement  which  is  not  collocated  with 
either  the  actuators  or  displacement  sensors.  Next,  let  the  ordering  of  the 
modes  in  (1)  be  such  that  E is  invertible.  In  this  case,  it  is  possible  to 

3. 

define  the  similarity  transformation  U such  that 


T XT 
E E^S 
a a 

-S  I 


-1  T X -1  ^a  ° 

U ■^  = U^(UU^)  ^ ^ 

SE  I 0 
_ a _ . r 


where 


X X 

E (I  + S S)E  = 
a a 


B B 0 0 

0 CC^  0 
0 0 DD^ 


= (I  + SS  ) 


E E 
r a 


Here,  S is  termed  the  spillover  matrix. 

Defining  rj  = Ur|,  the  original  system  of  (1)  and  (2)  can.be  transformed  into 
a particularly  favorable  form. 


n + gin  + - din  = bu 


y = Cri  + Dn 
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Siq 


where 


aa  ar 
ra  rr 


(-Sfi,  + fi‘S)E  M 

X L 3.  d. 


T,  , 

? {-'D^ 

/n2  I 


+ Sfl2s^)M  ^ 
2 1 r 


r 1 

B B 

B = 

0 

a 

0 

C = [C  0] 
a 


"a  ■ '"CB  ^ “> 


D = [D  0] 

3 


with  and  given  by 


= diag[0)^ 0)^  ];  = diag[co^  _i_^, 


If  ri  is  partitioned  such  that  h = J>  where  : n^  x 1 and  n^.  : n x 1,  then 
the  aggregated  and  residual  models  associated  with  the  transformation  U are 


ri  + Bill  + p - din  = B u 
a a aa  a a a 


y = . C ri  + D n 
a a a a 


p + 3lp  + p - diri  = 0. 
r r rr  r r 


Note  that  spillover  has  been  transformed  from  control  and  observation  spill- 

ver,  as_in  modal  truncation,  into  the  present  form  of  dynamic  coupling  matrices, 

^ and  , between  the  aggregated  and  residual  models.  Any  other  aggregation 
IT  3.  31T 

procedures  will  generally  result  in  spillover  appearing  in  both  forms.  The  set 
of  states  included  in  the  aggregated  model  is  solely  a reflection  of  the  sensor 
and  actuator  influences.  A reasonable  design  of  sensors  and  actuators,  for 
instance,  should  effectively  sense  and  excite  only  certain  critical  modes.  As  a 
result,  the  aggregated  states  would  be  primarily  composed  of  the  critical  modes 
and  the  spillover  matrix  S-  would  be  numerically  small. 

Denote  the  eigenvalues  and  left  and  right  eigenvectors  of  and  as 
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where 


(18) 


(19a) 


(19b) 
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A 

ra 


0 0.  0 

0 0 
ra 


rr 


dl  - 

rr 


I 

-31 


(19c) 


(19d) 


Denote  the  ith  eigenvalues  and  left  and  right  eigenvectors  of  A as  C > ^ > 

aa  a . "^a . 

1 1 

^ . Note  that  A is  the  closed  loop  dynamic  matrix  of  compensator  (17) 

. 3. _t  33 


applied  directly  to  aggregated  model  (12)-(13)  and  A^^  is  the  untouched  residual 

model  dynamic  matrix.  The  compensator  thus  has  a direct  effect  on  the  aggre- 
gated model  and  an  indirect  effect  on  A through  the  coupling  matrices  A and 

TIC  3TC 

^ /S  /S 

A . The  effect  of  this  coupling  on  the  original  eigenvalues  of  A and  A can 

T3  33  ICV 

be  approximated  by  perturbation  techniques,  see  Appendix  A.  The  original  eigen- 

+ (2)  (2) 

values  of  K and  ^ , which  are  C and  p"  , are  shifted  by  Ac''  and  Ap~  \ 

aa  rr  ^a . r . ’ a . » 

11  11 

respectively,  where  the  superscript  2 denotes  that  the  shifts  are  second  order 

(2)  ±(2) 

in  the  elements  of  the  coupling  matrices.  The  terms  A^  and  Ap  are  given 

by  ^i  ^ 


r . 

1 


= -<a. 

1 1 


0 

0 


ar 


[dl  - 


rr 


I]  ^ 

a . 

1 


[0 


0]^l 


a . 

1 


(20) 


Ap; 


±(2)  _ 


r . 

1 


— i +B  K(p  )Q(p  ) Y 

, o ± \ ra"-  r.  aa  a r.  ^ r.  ar'r. 

(3+2p  ) i i 11  1 


(21) 


where 


K(p  ) = K - H(F  - p I) 
r,  r. 

1 1 

Q(p"  ) = C + p"  D . 
r.  a r.  a 
1 i 


(22) 

(23) 


c (2.^ 

The  terms  C;  =5  + A?  ^ 

3.  , 3 , 3 , 

ill 


±c  _ ±c  . A i(2) 
p = p ^ + Ap 
r . r . ^r  . 
11  1 


are  then  the  approximated  closed 


loop  poles  of  the  overall  system.  They  would  be  an  adequate  approximation  of  the 


actual  values  if  the  coupling  between  A and  A were  weak. 

aa  rr 


Such  a weak 


coupling  condition  can  be  achieved  by  designing  a reasonably  small  S and  by 

shifting  the  poles  of  A away  from  those  of  A by  means  of  feedback. 

“ aa  rr 
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Equations  (20)  and  (21)  do  not  conveniently  allow  a.  quick  estimation  of  the 
effect  of  spillover  because  they  contain  quantities  as  , Jl  , X , etc.  which 

i i 

require  appreciable  computation.  For  simplified  expressions,  one  can  retain  in 
(20)  and  (21)  only  terms  quadratic  in  S;  see  Appendix  B.  In  this  case  aggregated 
and  residual  pole  placement  is  given  by 


«a.  - 

1,  i 1 


± ± , 1 
P = P.  + 

r . X 
1 


0 

Q,  X.  £- 


[dI-(2^- ^[0  (-Sfl^Hfi^S)(E^)  ^ 0]tfl  (24) 

3- . J.  Z 3 3 . 

1 1 


V-  s E 
(3+2pT)  ’'i  ^ 


K(pJ")Q(ppE^ 

i 1 a 


I+(0)h-(5-^)  ^E 
i 1 a 


K(P*)Q(pJ)E^ 


-1 


(25) 


where  S denotes  the  ith  row  of  S and  the  superscript  indicates  that  (24) 
i 

and  (25)  retain  only  terms  quadratic  in  S.  Equations  (24)  and  (25)  contain  only 

readily  available  quantities  such  as  p^,  etc.  and  yield  quick  estimates  of 

the  effect  of  spillover.  Furthermore,  since  the  next  higher  order  are  terms 
proportional  to  S**,  they  can  come  quite  close  to  equations  (22)  and  (23)  in 
value  for  spillover  that  is  not  too  strong. 

COLLOCATED  SENSORS  AND  ACTUATORS 

T T 

In  the  case  of  collocated  sensors  and  actuators,  C = II  B and  D = II  B , 

« OS  D S 

E 


so  that  B 


E 


. Only  n independent  columns  are  used  in  forming  U.  Since 

_ T _ 

M 5)2  = M 

r rr  rr  r 


it  follows  from  the  symmetry  of  5)^  ^^d  5)2  that 


y = M £ 
r , r r . 
i 1 


(26) 


±(2) 

In  this  case,  (21)  can  be  solved  for  the  real  part  of  Ap  ^ , giving 

. ^i 

ReaKAp-''  0 = Real  ^ — F7  K(p”  )Q(p“  )1\ 

r.  ± ^ 1 r.  r_.  1 


where 


(6+2p-  ) 
1 


(27) 


rr  = B"'B[X2  m - e'^(5)2+ s'^5)2S)E  + b'^BK(p“  )Q(p“  )M  ] ^E  (-5)2s'^  + S"5)2)£ 

"""  2a  “ — 


, + 
' i 


r . .a  a 1 
1 


+ 
r , 


^i  ^ 


\ nf 
2^  r. 


(28) 
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and  a superscript  * denotes  complex  conjugation. 

i 

It  is  a concern  in  designing  F,  G,  H,  K that  p not  be  destabilized.  Sup- 

' i 

pose  that  F has  distinct  roots  and  that  T is  its  transformation  to  Jordan  form. 


(29) 


where  H = diag[^^, • • • ].  Then  sufficient  conditions  for  the  shift  (27)  to 
be  non-positive  can  be  derived  for  several  feedback  forms. 

For  displacement  measurement. 


(1)  KII  be  symmetric 

CB 

(2)  (HT)*  = (T.“^Gn  ) 

Lii) 

(3)  -||(3+2p7  )|  < Im(Cj^)  < ||(B  + 2pJ  ) 

i i 


i = n + 1,  • • ■ 
u 

k = 1, •••,  n, 


n 


(30) 


For  velocity  measurement, 

(1)  KlI  be  S3mimetric  and  negative  semi-definite,  and  either 
DB 

(2a)  F,  G,  H = 0 
or 

(2b)  (HT)>v  = -(T  GHj^g) 

Re(?k)  - ~ ¥ 


k = 1, 


-|t(3+2p:  )|  < Im(Cj^)  < 1^(6  + 2p:  ) 


^i 


i = n +1, 
u 

k = !,•••, 


n 


(31) 


For  displacement  and  velocity  measurement. 


(1)  n 


CB 


an. 


DB 


where  a is  a scalar 


(2)  Kn„.n  be  symmetric  and  negative  semi-definite,  and  either 
DB 


(3a)  F,  G,  H 


0 


3 


or,  if  a - - < 0, 
(3b)  (HT)*  = -(T'^GHj^g) 

Re(?k^  - 


k = 1, 


n 


-||(3+2pJ  )|  < Im(^^)  < ||(3  + 2P^  ) 

i 1 


1 

k = 1, 


n 4*  1 , 
u 


n 


(32) 


It  is  Interesting  to  observe  that  for  displacement  measurement,  nonzero  F,  G,  H, 
e.g.  a velocity  observer,  are  needed  to  introduce  additional  damping  into  the 
residual  states.  With  velocity  measurement,  no  additional  dynamics  are  required 
Because  of  (16) , the  condition 
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|i(B+2p;  )|  < Im(r  ) < |i(3  + 2p;  )| 


i = n 1, 
u 

k = l,-“, 


(33) 

— 2 (2) 

requires  only  the  knowledge  of  the  smallest  eigenvalues  of  . The  shift  A? 

irxr  sl  ^ 

usually  generates  less  concern  since  the  designed  ? should  have  a large  margin 
of  stability.  ^i 

EXAMPLE 

For  the  case  of  collocated  sensors  and  actuators  with  the  displacement 
T 

measurement  y = B , II  = I,  choose  F,  G,  H,  K of  the  following  form: 

CB 


n = n 
u a 


^ . >0  for  i=l , . . . ,n^ 


G = 


H = (R^)  ^ 

a 


T -1 
K = -(R^)  ^ 
a 


R 


-1 


R 


(34) 


where  R = columns [y  *•* 
a a^ 


Y ] . Equation  (34)  satisfies  the  sufficient  condi- 

di 

n 

u 


tions  (30)  for  displacement  measurements.  Substituting  (34)  into  (19a),  the 
eigenvalues  of  the 

/s 

of  the  matrices  a 


eigenvalues  of  the  resulting  A can  be  seen  to  be  the  union  of  the  eigenvalues 

aa 


1 = 1 , • • • , n , where 
aa^  u 


aa . 
1 


■El 


®i 

0 

d - - k, 

a . 1 

1 


0 

1 


(35) 


see  Appendix  C. 
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The  compensator  of  the  form  (34)  affects  different  eigenmodes  of  the  aggre- 
gated model  Independently  so  that  g.,  and  k.  can  be  separately  designed  for 

i ^ t ^ 

each  i.  Denote  the  three  poles  of  «r  by  p , t = 0,+,-,  where  0 represents 

H3  . 3.  . 

1 i 

the  real  eigenvalue  and  + and  - represent  the  complex  conjugate  pair.  The  neces- 
sary and  sufficient  conditions  for  to  be  stable  are 

i 


g?  + e,(d  - < 0 

1 1 


(36) 


which  necessarily  requires  (d  - - k.)  < 0.  With  k,  so  chosen,  the  following 

scaled  quantities  can  be  defined.  1 


a . 

1 


d - - k. 

' a . 1 

1 


1/2 


^1  = 


d - - k. 

a . 1 

1 


|l/2 


®i  = 


§1 


A^  - k, 
a . i 
1 


3/4  • 


(37) 


Figure  1 shows  the  eigenvalues  p®  and  p"*"  for  different  values  of  | and  g . 

a , a . IX 

1 1 

Only  the  case  for  3 = 0 is  shown  since  weak  inherent  damping  typical  of  LSS  sys- 
tems would  be  swamped  by  the  feedback. 

The  important  result  from  the  figure  Is  that  the  eigenvalues  of  a , given 

t 1/2'^t  " ^ 

by  p = Id  - A^  ~ k. I p , hence  those  of  A , are  indeed  on  the  left  half 
a.  ' a.  i'  a.  aa 

111 

plane  and  can  in  principle  be  pushed  arbitrarily  to  the  left  by  choosing  an 
arbitrary  large  k..  Such  a compensator  given  by  (34)  would  therefore  stabilize 
the  aggregated  moSel,  preserve  the  validity  of  the  perturbation  technique  and 
guarantee  the  non-destabilization  of  the  residual  states. 


As  a numerical  example,  consider  a vibrating  string  with  fixed  ends  in  the 

region  0 < r < 1,  with  two  collocated  sensor  and  actuator  pairs.  For  simplicity, 

the  system  model  consists  of  3 modes,  includes  only  displacement  measurement 

(n^„  = I,  n = 0),  and  exhibits  a destabilizing  mechanism.  Thus, 

CB  DB 


n = - 


4ir^ 


0 9F 
b 


*1 

b 

b 1 

11 

12 

n + dp  + 

b2i 

CM 

CM 

) 

b 

J 

L 31 

32J 

11  ^21  ^31 


^11  ^22  ^^32 


(38) 


where  b . . = /2 
ij 

sensor  actuator  pair,  and  /2  sin  iTTr  is  the  eigenfunction  of  the  ith  mode. 


f.(r)sln  iirrdr,  f . (r)  is  the  Influence  function  of  the  jth 
, 1 J 
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Defining  the  new  control  u = 


^ll-' 

.^12  ^22, 


^11  ^12 
^21  ^22 


y , (38)  becomes 
o 


7T^  0 0 


u and  the  new  measurement  y - 
0 


1 0 


n = - 0 4tt,^  On  + dn+0  lu 

0 0 9'(T.^ 


1 0 s 


0 1 s 


where  (Sj^S2)  = [l>3i  ^>32] 

determined  to  be 


^11  ^12 
b2i  b22 


. The  aggregation  transformation  U is 


1 0 S2_' 


U = 0 1 

.“Si  -S2  1 

and  the  transformed  system  is  given  by 


where 


TT^ 

(1+  sj  + 

S|) 

1 + 

S-,  s. 

1 

1 2 

S1S2 

1 + S 

Ti  = - + 

dn  + Bu 

y = C n 

1 + 9s5  + 

8S^ 

83, 

1 2 

1 2 

1 

5S1S2 

4+  4S^+  9S2 

8Sj^  + 3S1S2 

5S2  - 33^82 

9+3^4-43 

L 0 0 J 

_ 'l  0 o' 

c = 

0 1 0 

For  this  example,  d = 2tt^  so  that  the  lowest  mode  in  the  system  is  destabilized. 
Choose  the  compensator  F,  G,  H,  and  K as  in  (34).  Let  = kir^  be  the  only 

undetermined  compensator  parameter  and 


Cl  = ?2  = 


g^  = §2  = 1.0412. 
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Then  from  Fig.  1,  with  € = 


p“  = p“  = - 0.60954 
ai  a.2 


p“  = p“  = - 0.39523  ± 0.60164/  . 

ai  a2 


The  eigenvalues  of  the  closed  loop  aggregated  matrix  A are  thus,  t=0,+,- 

3.3. 


p = P - kjr^ 

ai  ai ' ai 


and  p^  = p^  |2tt^  - 


&2  ^2 


and  the  eigenvalues  of  the  residual , model 


= ±<  IT 


9+  s?+  s! 


1+ 


Due  to  the  nonzero  coupling  matrices  and  A , these  eigenvalues  will  be 


shifted.  Figures  2 through  5 show  the  designed  poles  p 


corrected  poles  p'"*',  p"^^  and  p^^,  p^'^  for  different  values  of  k for  the  cases  of 

ai  r3  r3 

weak  spillover,  ^3  = 0.2,  and  strong  spillover,  8^  = 82  = 0.8.  For  compari- 

son, the  exact  value  of  the  corresponding  poles  are  also  included.  On  comparison, 
several  points  are  noted.  First,  as  k is  increased,  the  validity  of  the  pertur- 
bation result  is  enhanced  due  to  the  growing  distance  between  the  poles  of  A 

33 

and  A , this  is  most  apparent  in  Figs.  4 and  5.  8econd,  as  expected,  the 
validiiy  of  the  perturbation  technique  increases  with  decreasing  spillover. 
However,  the  damping  introduced  to  the  residual  states  increases  as  8^  and  82* 

Third,  for  very  large  k,  the  beneficial  spillover  damping  in  the  residual  states 
is  reduced.  This  is  again  a result  of  the  increasing  distance  between  the  poles 


the  designed  poles  p , p^  , the  perturbation 
+8  ^ 13 

p for  different  values  of  k for  the  cases  of 
^^3 


of  A and  A -.  Fourth,  given  their  ease  of  evaluation,  the  p"”^  and  p ex- 
aa  rr  ’ '^a^  '^r3 

pressions  yield  quite  reasonable  estimates  of  the  exact  values,  especially  for 

the  weak  spillover  case.  Finally,  and  most  importantly,  it  is  indeed  possible, 

as  illustrated  by  this  example,  to  have  a reduced  order  compensator  designed 

based  on  the  aggregated  model  and  still  guarantee  the  stability  of  the  overall 

system. 

CONCLU8ION8 

Spillover  cannot  be  eliminated  by  aggregation  procedures.  The  present 
method  merely  transforms  control  and  observation  spillover  into  the  form  of  dy- 
namic coupling  matrices.  The  effect  of  spillover  in  this  form  can  then  be  more 
readily  approximated  using  perturbation  techniques.  Two  sets  of  perturbation- 
corrected  pole  shifts  were  derived.  The  first  set  gives  very  close  approximation 
to  reality  if  the  validity  of  the  perturbation  technique  is  preserved.  Further, 
it  yields  sufficient  conditions  that  guarantee  the  non-destabilization  of  the 
residual  states.  The  second  set  of  pole  shifts,  obtained  by  truncating  the  first 
set  to  retain  only  quadratic  terms  in  the  spillover  matrix  8,  enables  quick 
estimations  of  the  pole  shifts. 


tS^  , ±S^ 


As  illustrated  by  a numerical  example  for  the  collocated  sensor  and  actua- 
tor case  with  displacement  measurement,  a reduced  order  compensator  which  yields 
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stability  for  the  overall  system  can  be  obtained  adopting  the  present  approach. 
Damping  is  indeed  introduced  to  the  residual  states  through  spillover.  The 
amount  of  damping  introduced  is  proportional  to  spillover  squared.  Thus  proper 
actuator  and  sensor  design  should  result  in  spillover  values  such  that  rela- 
tively large  damping  can  be  introduced  while  the  validity  of  the  perturbation 
technique  at  the  designed  control  gains  is  still  maintained.  Spillover  need  not 
always  be  minimized. 

The  present  method  yields  an  aggregated  model  order  related  to  the  number 
of  sensors  and  actuators.  Further  control  of  aggregation  model  order  can  be  ob- 
tained using  combined  (synthetic)  sensors  and  actuators  in  the  design  procedure. 
This  topic  is  presently  under  study. 

The  authors  wish  to  thank  the  Lockheed  Missiles  and  Space  Company  and  the 
U.S.  Joint  Services  Electronics  Program  (Contract  DAAG-29-78-C-0020)  for  their 
support  during  the  course  of  this  research. 
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APPENDIX  A:  EIGENVALUE  PERTURBATION  DUE  TO  COUPLING  MATRICES 


Consider  the  matrix  A 


A A 

aa  ar 

/S  /N 

A A 

ra  rr 


Denote  the  ith  eigenvalue,  left  and 


-f  + 


right  eigenvector  of  A^^  and  A^^  by  ^a^’  Pr^_’  ^r^’  ’^rj_’ 

/S  "i" 

tively.  If  A and  A were  zero,  the  ^ and  p“  would  be  the  eigenvalues  of 

31T  1l  d. 

/S  A 

However,  due  to  the  coupling  between  and  Aj-j-  through  the  partitions  A^^j.  and 

^ra>  ?a  • Prj^  will  be  shifted.  According  to  perturbation  theory,  the  shifts, 

(2)  ±(2) 

denoted  by  and  Ap^  , are  given  to  second  order  in  the  elements  of  the 

U-i  J-  ^ 

coupling  matrices  by 

A?^^^  = - X A [A  - C I]“^A  ^ (42) 

a a ar  rr  a , ra  a , 

1 1 i 1 
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>> 


Ap 


- - “ Prj[^^]  ^ar^  i 


r± 


(43) 


^ /\  /\ 

With  A as  given  in  (19),  i~  , y”  are  related  to  £ , y as  follows: 


r . 

X 


'r . 
1 


(3+pj-  ) 

^ £ 1 ^ 

(3+2p-^)  ^ (3+2p“^)  ^ 


r . 

1 


^ + 


'r . 
1 


Priori 


(44) 


Equations  (20)  and  (21)  are  then  obtained  by  substituting  (19)  into  (42)  and  (19) 
and  (44)  into  (43),  respectively,  and  applying  the  matrix  identity 


11 

M 

12 

-1 

M ^ + M (M  - M M M ^ 

11  ir  12^  22  21  11  12^^  21  11 

-M  (M  - M M ) ^ 

1112'-  22  21  11  12-^ 

M 

_ 21 

M 

22. 

-Cm  -m  m'"^m 

L ^22  21  11  12^  21  11 

(M  -M  )''^ 

^22  21  11  12^  J 

APPENDIX  B:  EXPRESSIONS  OF  SECOND  ORDER  ACCURACY  IN  S 

To  reduce  ^quation^  (20)  and  (21)  to  second  order  accuracy  in  S,  notice 
from  (11)  that  and  each  have  lowest  order  terms  first  order  in  S. 

Their  products  are  second  order  in  S,  thus  the  quantities  L , Yr . > > etc.  in 

I i I i I i 

(20)  and  (21)  can  be  replaced  by  their  zero  order  values  in  S.  As  a result, 
substitutions  in  (20)  and  (21)  can  be  made  as  follows: 

M E^E  ; M ->  I;  e'^'^ (e'^)~^;  ■> 

a a a’  r ’ aa  a 1 a^  ’ rr  2 ar  a 1 2'’  ra 


(-SS^J+f2^S)(E^)  ->  [0,  •••  ,0,1,0,  •••  ,0] ; ->  [0,  •••  ,0,1,0,  •••  ,0]^;  p* 

i i , ’’i 


The  new  equations  yield 


(2)S' 


and  Ap' 


±(2)S' 


, which  denote  equations  (20)  and 


(21)  reduced  to  quadratic  terms  in  S,  respectively.  The  relation 
('2') 

4-  A^^  is  then  (24)  while  the  relation 
^i  ^i 


±S^ 

D 

r . 

1 


± , A ±(2)S 

+ Ap  ^ ^ 


,2 


r . 

1 


= P 


1 

1 


r . 

1 


(3+2pp 


S (fi^-u)?I) 
r . 1 1 

1 


+ E 

i 1 a 


.-1 


K(ppQ(ppE^ 


(fi,-totI)S 
1 1 r^ 


(45) 


with  Sr^  denoting  the  ith  row  of  S,  can  be  further  manipulated  to  become  (25). 
For  F,  G,  H,  K = 0, 


= p~  + — r- s (J2,^  - o)?i)s^ 
’^i  ’^i  (3+2pp  ’^i  ^ 


(46) 


It  is  known  that  the  exact  eigenvalues  of  the  system  without  feedback  are  p^. 
+ 

Thus  (46)  is  p^  expressed  to  second  order  in  S,  and  the  approximation 
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(47) 


S - <I)S  - p“ 

(B+2pp  ’^l  ^ ^ 

Introduces  an  error  by  terms  which  are  fourth  order  in  S.  Since  the  expressions 
of  concern  are  to  second  order  in  S,  this  error  can  be  neglected.  Putting  (47) 
into  (45)  and  making  use  of  the  matrix  identity 

(M^  + M^)"^  = m“^  - M"^  M^d  + 

results  in  (25)  . In  this  manner  , (47)  acts  as  an  "initial  condition"  in  that  it 

±S^ 

provides  equality  between  p and  the  exact  values  in  the  absence  of  feedback. 

^i 

APPENDIX  C;  EIGENVALUES  OF  A WITH  F,  G,  H,  K AS  GIVEN  IN  (34), 

aa 

Two  relations  between  73,  and  should  be  noted.  First,  as  in  (26), 

T • XX 

Y„  . = M , Second,  the  orthonormalization  condition  Yo  = ^5 , . is  adopted, 
a • a ' a ij  ^ 

p 1 J 

With  M = B B for  the  collocated  sensor  and  actuator  case,  R = 

o 'a 

^ T T T ^ 

column [y  ••*,Y  ] and  L = column [£  *•*,£  ],  it  can  be  shown  that 

a,  a a a^  a 

In  In 

u u 

T T 

R = B BL 
a a 


Substitute  (34)  into  (19) , apply  a similarity  transform  to  the  resultant  A with 

3.3. 


0 0 


V = 0 L 

a 

0 0 


and  V 


10  0 

0 R 0 

a 

0 0 R, 


and  make  use  of  (48) . The  result  is  that 


VA  V 
aa 


d-X^  -k, 
a^  1 
X • . 


d-X'^  -k 
a n 
n u 
u 


By  rearranging  the  columns  and  rows  of  (48) , it  can  be  seen  that  the  eigenvalues 

of  VA  hence  those  of  A , are  given  by  the  union  of  the  eigenvalues  of 

33  ^ 33 

/\  /N 

«aa^’  ^ l5**‘jn^,  with  given  by  (35). 
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FIGURES 


Fig.  1.  p°  locus ( real)  and  p"*^  locus ( complex)  at  different 
^i  ^ ^ 

values  of  and  for  the  case  3 = 0.0. 
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Fig.  2.  Comparison  between  the  designed,  exact  and  approximated  closed-loop  eigenvalues 
of  the  first  aggregated  state  in  the  examined  numerical  example,  with  $2=  0 
and  k,  the  sole  undetermined  compensator  parameter,  takes  on  different  values. 
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Fig.  3.  Comparison  between  the  designed,  exact  and  approximated  closed-loop  eigenvalues 
of  the  residual  state  in  the  examined  numerical  example,  with  Si=  82=  0.2  and 
k,  the  sole  undetermined  compensator  parameter,  takes  on  different  values. 


REAL  PART  OF  EIGENVALUE<  UNIT*.  TT  > 

Fig.  4.  Comparison  between  the  designed,  exact  and  approximated  closed-loop  eigenvalues 
of  the  first  aggregated  state *in  the  examined  numerical  example,  with  S[=  82=  0 
and  k,  the  sole  undetermined  compensator  parameter,  takes  on  different  values. 


Exact  values 
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Fig.  5.  Comparison  between  the  designed,  exact  and  approximated  closed-loop  eigenvalues 
of  the  residual  state  in  the  examined  numerical  example,  with  Si=  S2-  0.8  and 
k,  the  sole  undetermined  compensator  parameter,  takes  on  different  values. 
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A DESIGN  PROCEDURE  FOR  ACTIVE  CONTROL  OF 

BEAM  VIBRATIONS 

Stephen  L.  Dickerson  and  George  Jarocki 

Mechanical  Engineering,  Georgia  Institute  of  Technology 
Atlanta,  GA  30332 


ABSTRACT 

This  work  analyzes  the  transverse  vibrations  of  beams  and  presents  a 
methodology  for  the  design  of  an  active  damping  device. 

The  Bernoulli-Euler  equation  is  used  to  derive  a transcendental  transfer 
function,  which  relates  a torque  applied  at  one  end  of  the  beam  to  the  rota- 
tional position  and  velocity  at  that  point.  The  active  damping  device  consists 
of  a wire,  a linear  actuator  and  a short  torque  arm  attached  to  one  end  of  the 
beam.  The  action  of  the  actuator  varies  a tension  in  the  wire  and  creates  a 
torque  which  opposes  the  rotation  of  the  beam  and  thus  damps  vibration. 

A contribution  of  the  paper  is  a design  procedure  for  such  an  active 
damper.  This  procedure  shows  the  relationships  and  trade-offs  between  the 
actuator  stroke,  power  required ,■ stress  levels  in  the  wire  and  beam  and  the 
geometry  of  the  beam  and  wire. 

It  is  shown  that  by  consideration  of  the  frequency  response  at  the  beam 
natural  frequencies,  the  aforementioned  relationships  can  be  greatly  simplified. 
Similarly,  a simple  way  of  estimating  the  effective  damping  ratios  and  eigen- 
value locations  of  actively  controlled  beams  is  presented. 


SYMBOLS 

A cross  sectional  area  of  a beam  (or  wire  if  sub  w) 
b feedback  gain 

d length  of  moment  arm 

E modulus  of  elasticity 

f tension  in  wire 

G open  loop  transfer  function 
H closed  loop  transfer  function 

J moment  of  cross  section  of  beam 
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L length  of  beam  (or  wire  if  sub  w) 

p open  loop  poles 

P power 

s d/dt  operator 

X beam  length  coordinate 

y beam  displacement 

z extension  of  wire 

open  loop  zeros 

6 normalizing  factor  for  time 

p normalized  length  coordinate 

0 rotational  angle 

closed  loop  damping  ratio  of  n^^  pole  pair 
a negative  of  real  part  of  eigenvalue 

T normalized  time 

to  frequency  of  oscillation 


MODEL  OF  BEAM  WITH  RATE  FEEDBACK 


A beam  of  constant  cross-section  and  material  properties  with  no  internal 
damping  and  shear  deformation  may  be  described  by  the  Bernoulli-Euler  equation. 


EJ  + Ap 
9x^ 


0 


(1.1) 


If  the  beam  is  of  length  L it  is  convenient  to  normalize  this  equation  using 


n = L ^ 


and  T = 6t 


(1.2) 


where  S = obtain  the  partial  differential  equation 


„4  2 

9n  6t 
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0 


(1.3) 


For  the  case  of  a beam  pinned  at  both  ends  the  boundary  conditions  y=0  and 
§2y/5^-2  = o at  n=0  and  n = 1 yields  the  modal  solutions 


y(t|>T)  = C sin  (o)  T+d))  slni4j~  n (1.4) 

n n n 

2 2 

where  o)  = Trn  , n=l,2,3,... 
n 

Such  a beam  if  excited  oscillates  with  frequency  components  These 

oscillations  can  be  damped  by  applying  a moment  at  either  or  both  ends  that 
opposes  the  rotational  rate.  That  is,  the  boundary  condition  at  p = 0 becomes 

M(0,t)  = -b  — 0(0, t)  (1.5) 

0,2 

where  0 = 6y/6p  and  M = -6  y/6p  . The  sign  convention  on  M is  necessary  to 
make  a positive  moment  result  in  a positive  0 and  y, 

A moment  opposed  to  rotational  velocity  will  damp  oscillations  because 
it  removes  energy.  Points  other  than  p = 0 are  acceptable  but  examination  of 
Eq.  1.4  shows  that  there  are  values  of  p on  the  modal  solution  where 
6^y/<5x(St  (i.e.,  d0/dt)  is  zero.  Such  modes  will  not  be  damped  if  the  moment 
is  applied  at  these  points. 

To  examine  the  effect  of  variation  of  the  damping  constant  b in  Eq.  1.5 
it  is  possible  to  consider  b as  a gain  in  a feedback  control  system  as  shown 
in  Fig.  1.  The  transfer  function  G(s)  is  derived  relating  the  input  moment 
M to  the  rotational  velocity  at  p = 0.  The  result  is 


G(s) 


/2s(  sin/^-  sinhv^¥) 

2 (cos  - cos  h/^) 


(1.6) 


where  s can  be  interpreted  as  the  derivative  operator,  d/dt.  The  "poles"  of 
the  transfer  function  are  the  zeros  of  the  denominator  or 


, 2 2 

Pn  = l^r  n , 


n = 1,2,3... 


(1.7) 


There  is  also  a pole  at  infinity  since  the  numerator  is  of  order  greater  than 
the  denominator.  The  "zeros"  are  given  by  the  zeros  of  the  numerator  or 


0 

j7T^(n+  0.25)^, 


n = 1,2,3. 


(1.8) 
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Actually  the  expressions  (n>0)  are  slight  approximations.  z]^=15.418  instead 
of  15.421  as  given  by  Eq.  1.8.  The  approximation  for  n>l  is  much  better.  From 
the  zero  and  pole  locations  one  can  write 


= „Bo  ' 1 + t: 


G (s)  - 


TT  (n+0.25) 


4 4 
n 7T 


(1.9) 


where  the  1/3  comes  from  Eq.  1.6  using  the  limit  as  s approaches  zero  of 
G(s)/s=l/3.  With  feedback  as  shown  in  Fig.  1 or  as  suggested  by  Eq.  1.5  the 
overall  transfer  function  is 


H(s) 


G(s) 

l+bG(s) 


(1.10) 


or 


I 3,  (1+ 7 

ir^ (n+0.25)^ 


H(s)  = 


N , S , 2? 


n , ( 1 H 7)  + -7  b fl , ( 1 + 

n=l  44  3 n=l 

n ir 


n2 (n+0.25)^ 


which  can  be  written  as 


s n,  (1+ 
n=l 


H(s)  = 


TT^  (n+0.25) 


3(1+Ts)  .n,  (1+  ^s  + (-^)^) 
1=1  0) 


cn 


cn 


(1.11) 


(1.12) 


where  the  closed  , loop  natural  frequencies  and  damping  ratios  are  evident. 
The  form  of  the  denominator  is  arrived  at  by  consideration  of  the  root  locus 
discussed  in  subsequent  sections. 


VARIATION  OF  SYSTEM  EIGENVALUES  WITH  FEEDBACK  GAIN 

The  characteristic  equation  of  H(s)  is  l+bG(s)=0.  A numerical  solution 
for  the  root  locus  for  the  first  few  branches  is  presented  in  Fig.  2.  This 
numerical  solution  uses  the  transcendental  form  of  G(s) , Eq . 1.6,  and  finds 
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valueis  of  s for  which  the  imaginary  part  of  G(s)  is  zero  and  then  calculates 
b from  "minus  inverse  b = real  part  of  G(s)." 

For  light  damping  (small  b)  a good  approximation  for  the  closed  loop 
damping  ratios,  5n>  found  by  (1)  assuming  = realizing  that 

H(jti)jj)  = 1/b  and  (3)  neglecting  the  complex  portion  of  most  of  the  denominator 
terms  as  follows. 


2 2 1 
H(m  TT  j)  = — = 


2 2 
m TT 


2 2 

3 1 1+Tm  ir  j 


(2.1) 


Now  if  the  imaginary  terms  in  the  d.enominator  above  are  neglected  except  for 
n = m then 


j. 

b 


2 2 


m TT 


-^m 


where 


^n  + 0.25  ^ 


n,  1 

n=l ' 
n^m 


<E)-| 


Thus  5 

m 


2 2, 
m IT  ba 


m 


(2.2) 


From  numerical  calculation  this  reduces  to 


?m~^  ’ ™=1>2,3,... 


(2.3) 


From  Fig.  2 it  is  apparent  that  this  approximation  is  good  for  small  b. 

"Small" b appears  to  depend*  on  the  m value.  From  numerical  results  the 
maximum  possible  damping  ratio  for  the  m^^  eigenvalue  is  apparently  given 
approximately  by 


0.28 

n 


(2.4) 


From  Fig.  2 and  Table  1 it  is  apparent  that  "low"  values  of  b damp 
higher  frequency  oscillations  better  while  "high"  values  of  b (but  not  greater 
than  0.3)  damp  the  low  frequency  oscillations  which  are  more  important  to  damp 
actively. 

An  approach  similar  to  the  above  can  be  used  to  estimate  the  closed  loop 
eigenvalues  for  "large"  b.  From  Eq.  1.11  the  characteristic  equation  is 
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n\  ^ TT  (n  + 0.25) 


) = 0 


(2.5) 


2 2 

Let  a solution  be  s = jir  (m+0.25)  - where  is  "small"  and  positive, 

substituting  into  Eq.  2.5  the  magnitude  part  becomes 

I ^ f-  / CJ  2 2(J  n 

T ff  I 1 /m  + 0.25^4  , m . m , , ^ .,rs2| 

3nHiU-(  --n > (”^+0-25)  | 

n ir  n IT 


+ b |j  ir^  (m  + 0.25)  ^ 


i??i,  /m  + 0.25^4,  m 

o n,  ) + ~7 7 

m'n=l'  n + 0.25  4,  , „ „r\4 

IT  (n+  0.25) 


(2.6) 


_ j2a  , I = 0 

“ ir^(n+0.25)^ 


If  the  imaginary  parts  (except  for  m=n  in  the  second  product)  and  the 
terms  are  neglected,  then 


1 |l-(!l±»^)''| 

.3^  n=l'  n ' N 

<^m  “ b ^ 2 ^ I 1 / m + 0.25 . 4 ^ 

n=ll^  '-n  + 0.25^ 


(2.7) 


which  by  numerical  calculation  reduces  to 


•^m  ~ b 


(2.8) 


From  Fig.  2 and  Table  2 it  is  apparent  that  this  approximation  is  good  for 
"large"  b and  is  better  for  large  m for  a given  value  of  b. 

Incidentally,  the  time  constant  associated  with  the  real  closed  loop 
eigenvalue  is  approximated  by  T = b/3  for  "large"  b.  For  "small"  b the  time 
constant  approaches  zero  and  in  most  cases  is  thus  unimportant. 


To  summarize,  the  eigenvalues  associated  with  mode  m (m=  1,2,3,...)  have 
damping  ratios  approximated  by  Eq.  2.3  for  "small"  b.  Eq.  2.4  and  Table  1 
provide  a dividing  line  between  "large"  and  "small"  b which  depends  on  m. 
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Eq.  2.8  gives  the  approximate  real  part  of  the  eigenvalue  (negative)  for  "large 
b.  A graphical  interpretation  of  eigenvalue  placement  is  given  by  Figure  3. 


ACTIVE  DAMPER  CHARACTERISTICS 

An  active  damper  construction  is  illustrated  by  Fig.  4.  A linear 
actuator  varies  the  tension  in  a wire  stretched  between  the  beam  ends.  The 
tension  force  f(t)  developed  by  the  linear  actuator  acting  on  the  torque  arm 
of  length  d creates  a moment  which  opposes  the  angular  velocity  at  the  beam 
end  and  thus  damps  vibrations  by  removing  energy  from  the  beam.  The  resulting 
control  loop  at  the  beam  support  where  the  torque  is  applied  is  depicted  in 
Fig . 1 . 

The  moment  equation,  rw  longer  normalized , is 


M = -b0  = -fd 


(3.1) 


Note  that  b (normalized)  must  be  multiplied  by  EJ/L6  to  get  the  actual  feedback 
gain  b.  If  z is  defined  as  the  extension  of  the  actuator  then  z has  two  com- 
ponents— that  due  to  rotation  of  the  moment  arm  and  that  due  to  elongation 
of  the  wire.  Thus 


z = d0  - 


fL, 


(3.2) 


Using  Eq.  3.1  we.  have 


z = (d-Bbs)0  where 


(3.3) 


and  as  before  s stands  for  d/dt. 

For  simplicity  any  initial  tension  in  the  wire  has  been  excluded  from 
this  analysis  but  is  necessary  in  a practical  application.  The  reader  should 
imagine  a constant  tension  device  in  parallel  with  the  dynamic  actuator  to 
provide  this  tension.  Naturally,  this  affects  the  force  variations  allowed 
in  the  wire  to  prevent  breakage  or  slackness. 

The  instantaneous  power  is  given  by  -(sz)(f)  or 


P = -[s(d-Bbs)0]  [^0]  (3.4) 

d 

Suppose  for  now  that  the  moment  provided  by  the  actuator  is  a known  amount  at 
a frequency  w. 
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M = Mq  s in  wt 


(3.5) 


Then 


Mq  Mq  2 -MoO) 

0 = S0  = - -r~  sin  ut , 0 = t — cos  tot , and  s 0 = — cos  o)t 

b DO)  ’ b 


Substitution  into  Eqs.  3.1,  3.3  and  3.4  gives 


M 

o 

f = — = -f 
MAX  d MIN 


MAX  ° bo)  MIN 


MAX  2b 


(3.6) 


-M, 


3 _ __2__  ^ 

MIN  2b  ^ d ^ ^ 


AVE  2b 


Each  of  f,  z and  P are  sinusoidal  at  frequency  o)  except  P is  at  frequency  2o). 


To  apply  the  above  relationships  one  needs  to  interpret  the  significance 


of  f 


MAX’  MAX’  MAX’ 


etc . 


^MAX  greatest  force  which  the  actuator  must  deliver.  It  plus  any 

initial  tension  in  the  wire  is  the  greatest  tension  in  the  wire.  An  initial 

tension  would  usually  exceed  f,,.„  in  order  that  the  wire  never  go  slack. 

MAX 


Two  times  is  the  stroke  required  of  the  actuator. 

MAX 

PmaX  the  greatest  instantaneous  power  required  from  the  actuator.  It 
is  zero  if  the  wire  is  inelastic  or  B = 0.  The  average  work  done  by  the  actuator 
over  a cycle  is  always  negative  since  the  actuator  is  being  used  to  remove 
energy  from  the  beam.  Pmin  always  negative  and  represents  the  maximum  energy 
absorption  rate  of  the  actuator. 

Mq  sin  (wt)  could  be  interpreted  as  a moment  applied  at  x=0  if  co=a)^. 

Under  these  conditions  the  actuator  must,  in  the  steady  state,  absorb  exactly 
the  moment  applied  by  an  external  excitation. 

Alternatively  for  a lightly  damped  case  where  the  motion  associated  with 
any  mode  is  approximately  sinusoidal  M^  can  be  related  to  an  initial  displace- 
ment or  velocity  of  the  beam.  For  instance,  referring  to  Eq.  1.4  if  the 
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maximum  midpoint  displacement  of  a mode  is  1 , then 


0 


sin  0)  T 
n 


S0 


MAX 


3^y 

8n3T 


,3/2 


3 3 
n ir 


from  which  M ==  bn  tt  , b normalized. 


NUMERICAL  EXAMPLE 

Consider  the  American  Standard  S6xl7.25  I-beam  for  which 

E = 29  X 10^  psi  J = 26.3  In'^ 

p = 7.36  X 10  ^ lb  sec^/in^  A = 5.07  in 

Suppose  the  active  damper  applies  a maximum  moment  of  20,000  in. lb.  and  that 
the  length  of  the  torque  arm  is  6 in. , in  this  case  equal  to  the  depth  of  the 
beam,  and  that  the  wire  is  0.375  inches  in  diameter  of  carbon  steel  with 
E = 29  X 10  psi.  This  gives  a maximum  stress  of  30,193  psi  which  must  be 
approximately  doubled  to  prevent  slack. 

If  the  initial  displacement  in  the  beam,  which  needs  to  be  damped,  is 
taken  as  that  which  results  from  a 50,000  psi  max  stress  in  the  beam  from  the 
first  mode  shape,  then  the  mode  shape  is  given  by 


y(x,0)  = C sin  ( ) 
where  C = 20.96  inches. 

Since  0„,„  = 0.1098  rad  and  6 = 1.256/sec,  o)-.  = = 12.39  rad/sec, 

MAX  J- 

from  x^hlch  “ 1.361  rad/ sec.  The  maximum  b allowed  is  1.47  x 10  in.  lb. sec. 

from  the  relationship  1.361  b = 20,000.  In  normalized  terms 

b = b ^ = 0.0145 

This  can  be  interpreted  as  adding  a damping  ratio  of  1.45%  to  the  low  frequency 
modes  and  moving  the  "high"  frequency  modes  to 

-a  = = -138  (normalized)  = -173/sec. 

b 
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The  "high"  frequency  modes  in  this  case  begin  at  about  the  20^^  mode  and  are 
probably  unimportant.  However,  the  same  wire  and  actuator  could  be  used  to 
provide  much  greater  low  frequency  damping  ratios  if  the  amplitude  of  oscilla- 
tion were  greatly  less  than  21  inches. 


SOME  PRACTICAL  CONSIDERATIONS 

Unfortunately,  no  real  control  system — measurement,  computation,  actuation 
— can  realize  a simple  gain  b at  all  frequencies.  Fortunately,  no  real  beam 
can  realize  zero  damping  at  any  frequency  and  as  a rule  that  damping  expressed 
as  a damping  ratio  increases  for  higher  modes.  From  root-locus  arguments  it  is 
possible  to  conclude  that  as  b realizes  a phase  lag  at  higher  frequencies  some 
of  the  loci  will  tend  toward  the  imaginary  axis— a generally  undesirable 
situation.  The, amount  of  motion  toward  the  imaginary  axis  is  reduced  by  the 
magnitude  attenuation  of  b at  higher  frequencies.  If  the  beam  has  no  inherent 
or  passively  supplied  damping  the  system  would  be  unstable.  Thus  as  a practical 
matter  the  actuator  must  have  a "flat"  frequency  response  up  to  a frequency  at 
which  the  beam  has  a high  inherent  damping  or  the  magnitude  of  b must  be  re- 
stricted by  considerations  of  effects  on  high  mode  eigenvalues. 
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Figure  1.  Block  Diagram  of  Feedback  Control  System 


Table  1.  Maximum  Damping  Ratio  and 
Corresponding  Value  of  b 
(Numerically  Determined) 

■^ode^m  ^rn,MAX  ^ 

1 0.289  0.378 

2 0.143  0.201 

3 0.0946  0.139 

4 0.0707  0.107 

5 0.0565  0.0866 

6 0.0471  0.0784 

10  0.0282  0.0441 


IMAGINARY 


Figure  3.  Eigenvalue  Placement 
Asymtotic  Values 


0(t) 


Figure  4.  Construction  of  Active  Beam  Damper 
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im 


Figure  2.  Root  Locus  for  Lower  Frequency  Modes 

(circles  on  loci  are  labelled  with  the 
values  of  b.) 
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ABSTRACT 


The  general  dynamics  of  a shuttle  supported  tethered  subsatellite  sys- 
tem taking  into  account  the  longitudinal  and  three  dimensional  transverse  vibra- 
tions is  considered.  It  is  noted  that  control  of  inherently  unstable  dynamics 
during  retrieval  of  the  subsatellite  can  be  carried  out  by  letting  the  rate  of 
change  of  length  depend  on  the  state  variables  in  an  appropriate  manner.  Cdntrol 
laws  using  linear  feedback  of  inplane  state  variables  and  nonlinear  feedback  of 
out-of-plane  state  variables  are  proposed. 


INTRODUCTION 


Tether  connected  multi  body  systems  have  various  potential  applications. 
A description  of  these  applications  is  given  in  a report^  by  the  Preliminary  de- 
sign Office  of  the  Marshall  Space  Flight  Center.  A shuttle  supported  tethered 
subsatellite  system  similar  to  the  Skyhook^  proposed  by  the  Smithsonian  Institu- 
tion for  low  altitude  scientific  experiments  is  being  considered  by  NASA.  To  en- 
sure successful  operation,  analysts  and  control  of  the  dynamics  of  the  system 
during  deployment,  stationkeeping  and  retrieval  stages  must  be  carried  out.  The 
general  dynamics  of  tethered  satellites  is  rather  complex.  The  degrees  of  free- 
dom involved  are: 
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a)  three  dimensional  rotational  motions,  of  the  endbodies; 

b)  three  dimensional  rotations  of  the  tether; 

c)  longitudinal  oscillations  of  the  tether;  and 

d)  three  dimensional  transverse  vibrations. 

The  major  environmental  effect  in  the  case  of  shuttle  supported  systems  is  that 
due  to  the  aerodynamic  drag.  A comparison  of  the  models  used  in  various  investi- 
gations of  the  shuttle  Supported  tethered  subsatellite  systems  has  been  carried 
out  by  Misra  and  Modi^. 

Control  of  the  dynamics  of  the  tethered  subsatellite  systems  is  likely 
to  be  carried  out  by  using  either  a tension  control  law  or  a length  rate  law 
i.e.,  by  letting  either  the  tension  in  the  tether  or  rate  of  change  of  its 
length  to  depend  on  the  state  variables  in  specific  ways?  Rupp**  formulated  a 
control  procedure  in  which  the  tension  was  proportional  to  the  actual  and  com- 
manded lengths  and  the  length  rate.  However,  only  the  rotation  of  the  tether  in 
the  orbital  plane  was  considered.  Baker  et  al^  and  Bainum  and  Kumar®  improved 
the  control  performance  by  choosing  a superior  set  of  gains.  A simulation  using 
a Rupp  type  control  during  deployment  and  retrieval  was  carried  out  by  Kulla^. 
Although  the  transverse  vibrations  of  the  tether  were  included,  the  longitudinal 
oscillations  and  the  out-of-plane  motion  of  the  system  were  ignored.  Kalaghan  et 
al®.  analyzed  the  Skyhook  system  in  detail,  taking  various  environmental  effects 
into  account.  The  tether  was  not  treated  as  a continuum, but  longitudinal  oscil- 
lations were  considered.  Although  References  7 and  8 incorporated  certain  vibra- 
tions in  the  dynamical  models,  the  control  laws  had  no  vibrational  feedback.  Du- 
ring retrieval  and  under  certain  circumstances  during  deployment  the  vibrations 
of  the  tether  increase  gradually,  indicating  that  appropriate  vibrational  feed- 
back must  be  used.  The  objective  of  this  paper  is  to  do  just  that. 


DESCRIPTION  OF  THE  SYSTEM 


The  Shuttle/Tethered  Subsatellite  System  is  shown  in  Figure  1.  The 
subsatellite  having  a mass  m is  supported  by  the  shuttle  through  a tether  hav- 
ing a mass  per  unit  length  and  instantaneous  nominal  length  L.  The  instan- 
taneous centre  of  mass  S of  the  system  may  be  assumed  to  coincide  with  the  cen-r 
tre  of  mass  of  the  shuttle,  since  the  mass  of  the  subsatellite  or  the  tether  is 
very  small  compared  to  that  of  the  shuttle.  S can  be  located  with  respect  to  the 
centre  of  the  Earth  E by  the  radial  distance  R , inclination  i of  the  orbit  to 
the  equatorial  plane,  argument  of  the  perigee  and  true  anomaly  6.  The  te- 
ther is  nominally  along  the  local  vertical.  The  instantaneous  orientation  of  the 
line  joining  the  two  ends  of  the  tether  is  defined  by  two  rotations,  roll  y and 
pitch  a,  in  that  order.  The  tether  is  stretched  longitudinally  due  to  the  gravity 
gradient  and  aerodynamic  drag.  This  longitudinal  strain  varies  along  the  tether, 
but  for  short  tethers,  this  variation  is  small.  Since  most  of  the  problems  in 
controlling  the  system  are  associated  with  short  tether  lengths  during  retrieval, 
it  is  assumed  that  the  longitudinal  strain  is  uniform  and  the  instantaneous 
stretched  length  L.  given  by 

^ L = L ( 1 ■+  e -)  > (1) 

/6 
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The  transverse  vibrations  u and  v perpendicular  to  and  in  the  plane  of  the  orbit, 
respectively,  are  superposed  on  the  stretched  tetherline.  These  transverse  vi- 
brations may  be  expanded  in  series  form  in  terms  of  a set  of  admissible  func- 
tions: 03 


U =.  E 

k=l 

(j)^(L,y)  A|^(t)  , 

(2a) 

00 

V = E 
k=l 

<l)k(L>y)  B|^(t)  , 

(2b) 

'l>|.(L,y ) 

= sin  (kuy/L)  . 

(2c) 

It  may  be  noted  that  ct)|^  is  not  invariant  with  time. 


The  equations  of  motion  corresponding  to  the  generalized  co-ordinates 
a,  Y>  Ej  A,  , B.  , k = 1,2, are  obtained  using  a Lagrangian  formulation. Line- 
arizing the  vi oratory  terms,  but  retaining  the  nonlinearities  in  rotations  which 
may  be  large,  one  gets  a set  of  nonlinear  equations  with  time  dependent  coeffi- 
cients. For  example,  the  equation  for  B|^  degree  of  freedom  is 


+ E [(B  -(JO  A )(L/L)(C  ,-C,.  ) - (L/L)2  D ,B  + (L/L)  C ,B  ] 
n yc  n^'  nk  kn^  \ ^ p v / y p|^  pj 


yc'  . zc  xc 


0 zc  xc 


yc 


where  y,  6(^,  0|^,  Cp^,  Dpi^,  (o|^,  cjOxg>  Wyg,  oizc>  ^xc’  %c»  ^zc>  defined  in 

Appendix  A.  Dtner  equations  are  also  given  in  Appendix  A.  The  generalized  forces 
are  due  to  the  atmospheric  drag.  The  equations  are  nondimensional ized  by  chan- 
ging the  independent  variable  from  time  f to  true  anomaly  0,  using  a new  set  of 
vibrational  variables 


Ak  “ Ak/L 

(4a) 

Bk  = Bk/L 

(4b) 

and  defining  a nondimensional  length 

n - 5<n(L/Lp0^)  , 

(4c) 

where  L^gf  is  an  arbitrary  reference  length.  Note  that  a, 
sional  to  start  with.  It  was  found  convenient  to  use  the 
ther  length  as  L^g^.i.Eq.S  can  now  be  rewritten  as 

Y,  and  e are  nondimen 
final  unstretched  te- 
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B|^  + (3n'-F)  B|^  + [co|+ri"+2ri'^-Fri'-(a'+CY)^-('Y'sa-SYca)^+G(l-3s^ac^Y)]  B|^ 

+ .^  C{Bj+n'Bj-(Y'sa-SYca)  ^jnCj(^-C|^j ) n’  - n'Dj|^Bj+(n"+n'^-Fri':) 

- sa-SYCOi)  A|^  - [Y"sa-2Y'cacY-(3G+l)saSYCY+(3ri'-F)(Y'sa-SYCa)]  A|^ 

- a|^[( l+e){a"-2Y' SYC^a-F(a'+CY)+Y' ^Saca+(3Gc^Y"S^Y)saca}+2(a'+CYf £* ] 

+ 26|^  n' 0+e)(a'+CY)  = Qj^j^  (5) 

where  F = 2 e s6  G 

G = (1+e  c0)-> 

= (w^/n^)  {(l-e^)V(l+e  c0)n 

= mean  orbital  rate 
e = eccentricity  of  the  orbit. 

Nondimensional  equations  corresponding  to  the  other  degrees  of  freedom  are 
omitted  here  for  brevity.  Defining  x as  the  state  vector  comprising  of  the  gene- 
ralized co-ordinates  and  their  derivatives,  the  equations  of  motion  can  be 
abbreviated  as 


A x'  = f . (6) 

If  N-j  and  Nq  are  the  number  of  admissible  functions  retained  in  the  modelling  of 
inplane  and  out-of-plane  transverse  vibrations,  respectively,  then  there  would 
be  2(3+N:|+Nq)  elements  in  x.  Note  that  A is  a square  matrix  dependent  on  x as 
well  as  dimensionless  time~0  and  f is  a vector  of  rather  lengthy  nonlinear~ 
functions  of  x and  0, 


DYNAMICS  AND  CONTROL  DURING  RETRIEVAL  OF  THE  SUBSATELLITE 


Deployment  of  the  tethered  subsatellite  to  the  required  altitude  and 
its  subsequent  retrieval  can  be  carried  out  by  either  providing  appropriate  ten- 
sion in  the  tether  or  controlling  the  rotation  of  the  tether  feedout  spool, 
thereby  controlling  the  rate  of  change  of  the  undeformed  length  of  the  tether. 
The  second  approach  is  followed  here,  i.e.,  is  specified. 

Deployment  is  basically  a stable  operation^.  Hence,  attention  is  fo- 
cused on  the  dynamics  and  control  during  retrieval.  Figure  2 shows  the  out-of- 
plane  portion  of  the  uncontrolled  retrieval  dynamics  using 

L = CL  (7) 
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where  c = -2x10"“*  sec"^  ("c  = cxlO"*  = -2).  The  shuttle  is  in  a polar  orbit  with 
its  major  axis  220  km  larger  than  the  earth's  radius  and  having  an  eccentricity 
0.001.  The  mass  of  the  subsatellite  is  150  kg  and  that  of  the  tether  per  unit 
length  is  1.5  kg/km.  One  may  note  the  growth  of  roll  and  transverse  vibrations. 
[Numerical  integration  was  terminated  when  the  displacements  became  too  large. 

Clearly,  some  sort  of  control  is  required  to  eliminate  the  dynamical 
instabi 1 i ty  associated  with  retrieval  of  the  subsatellite.  It  must  be  pointed 
out  tliat  it  is  not  necessary  to  have  stringent  shape  and  attitude  control  during 
retrieval.  The  objective  of  a control  procedure  is  to  arrest  the  unlimited 
growth  of  the  displacements  and  bound  them  to  reasonable  limits.  For  nonlinear 
systems  having  time  varying  coefficients,  it  is  difficult  to  devise  such  a con- 
trol procedure.  Hence,  to  gain  some  insight  linearized  system  equations  valid 
for  zero  eccentricity  are  examined,  although  subsequent  numerical  simulation  is 
carried  out  without  any  approximation.  Considering  A.  and  B.  degrees  of  free- 
dom one  has 


\"+3n'A,;+(r,"H-2n'M+:^)  \ + f [nVCjk-Cfcj)Aj+{n''Cjk+n'f2Cjk-Ckj-Dy))f;j] 

J 

- 0'k(Y"+4Y)  - 26^ti'y‘  = Qak 

I 2 ^2  ^ 00  ,,  ,2 

B|^  + 3n'Bk  +(n"+2n'  +w[<)B|<  [n'(Cji<-C|<j)Bj+{n  Cjk+n  (2Cjk-Ckj-Dkj)}Bj  ] 

-3  ^ 

- o'|^(a"+2e ' +3a)  + 26|^n ' (a'+e)  = Q[^k"26|<n'  (Bb) 


and 

e"  + 2pin'e'  +[co£+y^(n"+n'  )]e  - 2]j^  Oj(Bj+3q'Bj)  - 2a' 

J X 

where  = (m5+%pj.L)/(m5+V3  PtL) 


= Q^.  + 3 + yi(n"+n'^)  > 
(8c) 


and  P2  = ptL/(ms+V3  PtL) 


Similar  equations  for  a and  y omitted  here  for  brevity.  The  presence  of  the 
terms  26^11'  and  q"  in  the  right  hand  side  of  Eqs.  (8b)  and  (8c)  ,respecti vely 
suggests  that  inplane  transverse  vibrations  and  oscillations  may  be  reduced  by 
letting  the  length  rate  q'  depend  linearly  on  transverse  vibrational  rate  and 
longitudinal  strain.  However,  no  such  term  is  present  for  out-of-plane  vibra- 
tions and  a nonlinear  control  law  in  conjunction  with  nonlinear  equations  must 
be  considered  for  that  case.  This  conclusion  of  linear  state  feedback  of  out- 
of-plane  vibrations  is  parallel  to  the  case  of  rotational  control  for  which  it 
has  been  shown®that  nonlinear  equations  of  motion  must  be  considered  and  an  ap- 
propriate control  law  is  2 

q'  = Kq  [l+K^a'+K^Y'  ] , (9) 

where  and  Ky  are  negative  constants  and  K9  is,  in  general,  a negative  func- 
tion of  6 defining  the  commanded  or  nominal  retrieval  rate. 

It  may  be  noted  that  'modal  decoupling'  is  possible  only  for  a cons- 
tant length  tether  (i.e.,  q'-  0),  but  not  during  retrieval.  This  is  due  to  the 
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fact  that  there  are  no  conventional  normal  modes  for  a tether  having  a time  va- 
rying length  and  the  modal  co-ordinates  (loosely  termed)  are  coeffi cents  of  ad- 
missible functions  which  are  time  dependent.  This  coupling  turns  out  to  be  use- 
ful due  to  the  following:  6|<;  and  hence  26|<.n'  is  zero  for  an  even  k and  without 
coupling  vibrational  feedback  to  n'  would  have  no  effect  (except  of  second  or- 
der) on  even  modes. 

The  length  rate  chosen  to  control  the  general  dynamics  is  of  similar 
form  as  Eq.(9)  and  is  given  by  t t 

n'  = Kq  [l+K-x.+K‘g(xQ)]  (10) 

where  x.j  and  Xj,  are  the  inplane  and  out-of-plane  portion  of  the  state  vector,  K.j 
and  !<Q~are  a set  of  gain  vectors  and  g is  a nonlinear  vector  function  of  x^. 
There  are  no  standard  procedures  to  determine  and  Kg.  Here, they  are  chosen  so 
that  in  approximated  forms  of  Eq(8),  the  negatTve  damping  due  to  negative  n'  is 
eliminated. 

The  pitch  and  roll  behaviour  of  the  system  during  retrieval  of  the 
subsatellite  using 

n ' = £(  l-a'  -9y'  ) , c = -2xl0”‘‘sec"’-  (11) 

JL 

is  shown  to  be  stable  limit  cycles  in  Reference  9.  The  vibrational  behaviour 
using  the  same  length  rate  is  shown  in  Figure  3.  The  longitudinal  oscillation 
grows  rather  fast  and  the  transverse  vibrations  are  building  up  slowly.  Numeric 
cal  integration  was  stopped  when  the  longitudinal  oscillation  became  too  large. 
Figure  4 shows  the  response  of  the  system  when  the  length  rate  is: 

n = £(1-y'-9y'^-K££)  , K^=30,  c=-2xl0"'*sec'\'(12) 

A 

Axial  vibrations  are  more  or  less  eliminated.  The  transverse  displacements  repre- 
sented by  Aj,  A^,  Bj,  and  Ba  and  the  resultant  transverse  deflection  of  the  mid- 
point of  the  tether  denoted  by  R |1 so^are^smal 1 towards  the  end  of  retrieval .The 
dimensionless  modal  co-ordinates  A^,  B , and  B^  are  bounded  within  reasona-. 
ble  limits,  although  they  have  grown  during  retrieval.  Using  a length  rate 

n'  = £(l-a'-9Y'^-K^  B‘)  , (13) 

A 

with  K.  =.10  and  employing  a viscous  damper  to  provide  a force  proportional  to 
the  longitudinal  strain  rate^one  obtains  the  system  response  shown  in  Figure  5. 
This  response  is  very  similar  to  that  in  Figure  4,  except  that  the  transverse 
vibrations  are  slightly  smaller.  The  retrieval  dynamics  up  to  a tether  length 
of  250  m using  Eq.(12)  is  described  in  Figure  6.  K^,  is  30  for  L>1.2km,  but  is 
300  for  L<1.2km.  It  may  be  noted  that  the  vibrations  remain  small  up  to  the  end 
of  retrieval. 


CONCLUDING  REMARKS 

The  salient  features  of  the  analysis  may  be  summarized  as  follows: 

(i)  Control  of  longitudinal  and  transverse  vibrations  of  a shuttle  supported 
tethered  system  during  its  retrieval  can  be  carried  out  by  letting  the 
length  rate  depend  on  the  vibrational  state  variables. 

(ii)  Linear  feedback  of  inplane  state  variables  and  nonlinear  feedback  of  out- 
of-plane  state  variables  are  used. 

(iii)  Several  control  procedures  with  reasonable  success  are  proposed. 
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(iv)  It  is  suggested  that  some  effort  should  be  directed  towards  developing  sui- 
table procedures  for  control  of  flexible  systems  where  nonlinearities  and  time 
dependence  of  system  parameters  cannot  be  ignored. 


REFERENCES 


1.  Preliminary  Design  Office,  "Shuttle/Tethered  Satellite  System  Conceptual  De- 
sign Study”,  NASA  TM  x-73365,  Marshall  Space  Flight  center,  December  1976- 

2.  Colombo,  G.  et  al . , "Shuttle-Borne  Skyhook",  Smithsonian  Astroph.ysical  Obser- 
vatory Report  in  Geoastronomy  No.l,  September  1974. 

3.  Misra,  A.K.  and  Modi,  V.J.,  "Deployment  and  Retrieval  of  Shuttle  Supported 
Tethered  Satellites",  Journal  of  Guidance,  Control  and  Dynamics^  Vol .5, No. 3, 
May-June  1982,  pp. 278-285. 

4.  Rupp,  C.C.,  "A  Tether  Tension  Control  Law  for  Tethered  Subsatellites  Deployed 
Along  Local  Vertical",  NASA  TM  x-64963,  Marshall  Space  Flight  Center, Septem- 
ber 1975. 

5.  Baker,  W.P.  et. al ., "Tethered  Subsatellite  Study".  NASA  TM  x-73314  , Marshall 
Space  Flight  Center,  March  1976. 

6.  Bainum,  P.M.  and  Kumar,  V.K.,  "Optimal  Control  of  the  Shuttle-Tethered-Sub- 
satellite  Study",  Acta  Astronaut! ca,  Vol . 7,  No. 12,  1980, , pp . 1 333-1 348. 

7.  Kulla,  P.,  "Dynamics  of  Tethered  Subsatellites",  Proceedings  of  the  Symposium 
on  Dynamics  and  Control  of  Non-Rigid  Spacecraft",  Frascati,  Italy,  May  1976, 
pp. 349-354. 

8.  Kalaghan,  P.M.,  et  al . , "Study  of  the  Dynamics  of  a Tethered  Satellite  System 
(Skyhook)",  Final  Report,  Contract  NAS8- 32199,  Smithsonian  Institution,  As- 
trophysical  Observatory,  Cambridge,  Mass,,  March  1978. 

9.  Xu,  D.M.,  Misra,  A.K.,  and  Modi,  V.J.,  "Three  Dimensional  Control  of  the 
Shuttle  Supported  Tethered  Satellite  Systems  during  Retrieval",  3rd  VPISU  / 
AlAA  Symposium  on  Dynamics  and  Control  of  Large  Flexible  Spacecraft,  Blacks- 
burg, Va. , June  1981 . 


APPENDIX  A 


Equations  for  A|^  and  e degrees  of  freedom  are  as  follows: 

. » • • 00  • • 

Ak  +(L/L)A|^  +[w^-u)J^-a)^^+(p/R^)(l-3;i^^)]A|^  ^ ^ 

-(L/L)  D j|^Aj+( L/L ) C j i^Aj ] + Bcoy^Bi^  ‘^'(^yQ’*'^xc^zc''^^L^'^^o^^xc^zc^^^/^^^yc^^k 
-[a|^L{o)2c-Wxcmy(-+(3p/RQ)£xc^yc^(  *^bk/Pt*-  (A-1) 
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e + 2Uj^(L/L)e  +[u)g+y^  (L/L)  ]e  -[a)xc+w|c"(vi/Ro)  ( -y(L/L)] 

oo  • > • • 

-2(li2/L)  5 cf|^[wxc{B|^+2(L/L)B|^}  - o)2c{A|^+2(L/L)A|<;}] 

^ ’ 3 ^ 

"(y2/L)[Wxc'“yc‘^zc“^^’^/'^o)^yc^zc^i,L'^k^k  " Qe/^  (ftls+Va  f^L) 

K"“  i 


where 


y = gravitational  constant  of  the  earth 


('ii5+^Pt^)/('^j-'^V3  P|-L) 

y^=  p^L/lm^^+Vs  P^L) 

0|^=  /2  (-l)^“Vk77 

6|^=  2v^ /kTT  for  odd  k,  0 for  even  k 

j ^ k 
j = k 


^jk" 


D-iu- 


'jk 


2jk/(k^-j^)  , j ^ k 
% '‘  > j “ k 
4jk(j^+k^)/(j^-K^)^  , 

% + k^TT^/3  , 


0)2  = {k7r/L)2(EAE/p^) 

0)2  = EA/(m^+V3PtL)L 

0)xc=  « + 9cy 

o)y(.=  ysa  - Ssyca 

0)^^=  yea  + Bsysot 

£ = sr 
xc 

cacy 
SL^^=  -sacy 


Equations  for  a and  y are  given  in  Reference  9. 


(A-2) 


324 


Figure. 1.  Geometry  of  motion. 


Figure  3. 


Vibrational  response  in  the 
presence  of  rotational  control 


orbits 

Figure  4.  Response  of  the  system  using 
a length  rate 

n'=c(l-a'-9Y'  -K^e),  K^=  30 
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ORBITING  CHAINS  AND  RINGS 

John  V.  Breakwell 
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Stanford,  CA  94305 


ABSTRACT 


This  paper  summarizes  some  recently  published  analyses  of  the  hanging  chain  and 
the  orbiting  ring. 


1.  INTRODUCTION 


This  paper  describes  two  very  different,  highly  flexible,  space  structures  which  have 
been  proposed  during  the  last  decade  and  which  involve  analytical  solution  of  certain  par- 
tial differential  equations. 

The  flrst  structure  is  an  array,  or  "hanging  chain"  of  aluminum  beads  [1]  which  would 
serve  as  a convenient  communicator  (see  Fig,  l)  if,  under  the  influence  of  the  earth’s  grav- 
ity gradient,  it  assumes  a local  vertical  orientation  as  it  circles  the  earth.  The  analysis  for 
small  deviations  from  vertical  uses  a continuous-chain  model,  and  pitch  and  roll  frequen- 
cies and  mode  shapes  are  readily  obtained.  Except  for  the  lower,  rigid,  modes,  damping  is 
provided  by  fluid  in  the  joints  (see  Fig.  2)  between  the  aluminum  beads.  Two  passive 
schemes  have  been  proposed  for  damping  the  rigid  modes:  (i)  twist  the  wire  at  the  ends  to 
provide  non-zero  moment  of  inertia  about  the  vertical  [l],  thereby  inducing  relative 
motion  of  the  two  tips  during  rigid  pitch  or  roll,  and  thus  exercising  a damper;  (ii)  intro- 
duce weak  lossy  springs  [2]  between  the  end  sections  and  the  main  section;  thereby  pro- 
viding; linear  coupling  between  the  springs  and  all  the  in-plane  (pitch)  modes.  Damping  of 
the  rigid  roll  is  achieved  by  tuning  the  spring  constant  so  that  one  of  the  in-plane  combi- 
nation modes  has  frequency  4 times  orbital  frequency.  Nonlinear  coupling  then  provides 
slow  energy  transfer  back  and  forth  between  rigid  roll  and  the  particular  in-plane  combi- 
nation mode,  and  hence,  eventually,  damping  of  the  rigid  roll, 

The  second  structure,  for  some  future  generation(l),  is  a complete  ring  of  satellites 
cabled  together  at  synchronous  altitude  [3],  By  a slight  increase  in  altitude,  the  cable  is 
in  tension.  This  configuration  is,  however,  unstable,  and  an  active  feedback  control 
scheme  is  required  to  stabilize  it.  A possible  scheme  [3]  involves  local  cable  length  adjust- 
ment based  on  measurement  of  local  altitude  and  shape  variations  and  their  rates. 
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2.  THE  HANGING  CHAIN 


For  launch,  the  array  is  coiled  in  a container.  Deployment  consists  in  driving  the 
array  out  of  the  container,  one  end  first.  If  the  scattering  elements  were  supported  by  a 
wire,  the  array  would  retain  an  unacceptable  residual  curvature.  A freely  jointed,  bead- 
chain  construction,  on  the  other  hand,  would  rely  on  the  rather  weak  gravity  gradient  at 
synchronous  altitude  for  straightening.  Some  inaccuracy,  however,  in  deployment  could 
result  in  an  initially  tumbling  motion,  and  as  this  tumbling  motion  is  slowed  by  damping 
(see  later),  there  will  be  periods  of  compression  rather  than  tension  in  the  chain  (see  Fig. 
3),  which  could  result  in  the  chain  tying  itself  into  knots.  A happy  compromise  consists  in 
a support  structure  of  about  100  sections  of  relatively  stiff  wire  connected  by  joints  that 
are  free  up  to  about  3“  of  motion  (see  Fig.  2).  Residual  curvature  of  the  sections  is  now 
tolerable,  and  knotting  is  impossibie.  Gravity  gradient  and  damping  can  be  relied  upon  to 
align  and  straighten  the  array. 

Modeling  the  chain  as  a perfectly  flexible  cable  of  length  L and  linear  density  a , the 
tension  in  a vertical  equilibrium  position  is: 


(1) 


where  n is  the  (constant)  orbital  angular  rate. 

Adopting  the  coordinate  system  of  Fig.  4 and  a dimensionless  independent  variable, 
f = 2x/ L , the  partial  differential  equations  of  motion  become; 


(1-f) 


dy  _ 


3 dhj 
3n®  dt^ 


(2) 


(1-n 


d^z  dz 

^ di 


Z d^z 
3n^  dt^ 


The  mode-shapes  are  Legendre  polynomials: 


yiiO  = MO  = APiiO 


and  the  frequencies  are  given  by: 


(Uy/n)^  = {ug/n)^-l  = |-f(I  + l) 


(3) 
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Damping  in  the  joints  may  be  represented  by  small  additions  to  the  equations  of 
motion.  In-plane,  for  example: 


^ 
3 n 


z 


y 


= j-nym 


(4) 


where  r)  is  & small  dimensionless  damping  parameter,  and  the  rate  of  energy  loss  in  the 
mode  is  approximately 


providing  damping  of  all  but  the  rigid  mode:  Z = 1 . 

The  first  passive  scheme  proposed  [l]  for  damping  the  rigid  pitch  and  roll  modes  is 
illustrated  in  Figs.  5,  6 and  7.  For  a chain  of  length  150  m and  "tip  inertias"  of  radius  5 m, 

it  was  estimated  that  in-plane  libration  could  be  decreased  by  a factor  2 in  5 —days,  and 

out-of-plane  libration  a little  faster. 

The  second  passive  scheme  requires  the  tuning  of  springs  inserted  between  the  end 
portions  of  the  chain  and  the  main  portion.  The  spring  constants  must  be  adjusted  so  that 
one  of  the  odd  in-plane  combination  modes  has  frequency  at  or  close  to  twice  that  of  rigid 
roll,  i.e.,  -Cnmes  orbital  frequency  n.  This  provides  for  a slow  interchange  of  energy 
between  the  rigid  roll  and  the  particular  in-plane  mode.  The  necessity  for  choosing  an  odd 
rather  than  an  even  in-plane  mode.  The  necessity  for  choosing  an  odd  rather  than  an  even 
in-plane  mode  arises  from  the  requirement  that  the  two  springs  act  together  rather  than 
in  opposition.  It  was  estimated  that,  if  the  time-constant  of  the  in- plane  mode  was  10 
days,  due  to  energy  loss  either  in  the  spring  or  in  the  cable  joints,  a 2 1/2%  mismatch  to 
the  frequency  4n  could  still  reduce  a rigid  roll  amplitude  from  30°  (unacceptable)  to  4° 
(acceptable?)  in  about  60  days. 


3.  THE  ORBITING  RING 


Treating  the  ring  [3]  as  a continuous,  perfectly  flexible,  chain  with  longitudinal  elasti- 
city in  circular  orbit  at  radius  R about  a spherical  planet,  the  equilibrium  tension  is: 


= aR{n^R  - fx/ R^) 


(5) 


proportional  to  the  difference  between  centrifugal  and  gravitational  acceleration.  The  ten- 
sion in  a displaced  position  is: 
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T = T,+k  T,  +*(x+3/„)  . (6) 

where  a is  the  angular  position  of  an  element  of  the  ring.  Introducing  two  non- 
dimensional  parameters: 

w®  = T^/{n^aR^)  = l-;u/(n®/?3)  , (7) 

typically  much  less  than  unity,  and 

cjf  = k/{n^aR)  . (8) 

typically  much  greater  than  unity,  and  the  non-dimensional  time: 

T = nt  , (9) 

the  equations  of  motion  are: 

?/TT-2l/T-(3-2w^)2:-«2(x„„-y„)+«f(x+y„)  = 0 

y„  + 2Xr  - - oK^a  + Vaa)  = 0 

Ztt  + (1  - - O^Zaa  = 0 

Modal  solutions  have  x,y,z  proportional  to  m being  an  integer,  yield- 

ing the  following  characteristic  equations: 

~[  1 +(m^+2)u^+  (m^+  l)«f  ]Q^  +4771  )nm 

- 3wf)  = 0 , (10) 

arising  from  the  in-plane  coupled  x,y  equations,  and 
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- (l-w®)  = 0 


(11) 


arising  from  the  out-of-plane  z equation. 

The  ring  is  unstable  unless  both  characteristic  equations  lead  to  real,  distinct,  values 
of  Qto  . for  each  m > 0 . The  out-of-plane  motion  is  thus  clearly  linearly  stable, 


In  the  limiting  case  (uf  = “)  of  the  inextensible  ring,  the  in-plane  characteristic 

— g — , leading  to  instability,  corresponding  in 


fact  [3]  to  a growing  rigid  translation  out  of  position.  Numerical  investigation  of  the  exten- 
sible case,  , revealed  that,  although  the  mode  m = 1 could  become  stable  in  the 

unlikely  case  that  af  were  sufficiently  small  in  comparison  to  , some  other  mode 
would  always  go  unstable. 


Stability  for  m = 1 is,  moreover,  impossible  when  elastic  damping  is  taken  into 
account.  To  see  this,  we  replace  h by  k{l-¥^jQi)  . where  /3  is  real  and  positive,  and 
hence  w®  by  j / ^)  , where  K is  the  undamped  value  of  wf  . A typical  root-locus 
vs  K , for  fixed  /3  greater  than  zero,  is  sketched  in  Fig.  8.  Since  one  branch  immediately 
enters  the  lower  half  flx-plane,  instability  occurs  for  all  K > 0 . 


It  has  been  shown  [3]  that,  at  least  for  of  » 1 and  o®  « 1 , the  following  control  law 
stabilizes  all  modes: 


T = Tg  + k{x  -i-ya-u) 


(13) 


u being  a local  cable  length  change,  given  by: 


u 


3-gcj^ 

AT(l+/?s) 


X + 3x„„ 


As 

K{l+ps) 


aa 


AT(l+/3s) 


1 1 3 


(13) 


where  the  Laplace  variable  s indicates  differentiation  w.r.t.  time.  Indeed,  the  resulting 
characteristic  equation  takes  the  form: 


A4(nm)-jV(n^)  = 0 


(14) 


where 
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Ag*(n;„)  s ^Aa(n^)  + /r/SAa(n„) 


(15) 


and  where  the  polynomials  Ai(Qm)  have  zeros  as  indicated  in  Fig,  9.  Since  the  zeros  of 
. like  those  of  Ag(n^)  , always  separate  those  of  A4(Q^)  , the  root-locus  vs  A has 
all  branches  entering  the  upper  half  -plane  as  soon  as  A > 0 . and  some  damping  of  all 
modes  is  assured. 

It  is  interesting  to  note  that  the  system  is  not  completely  controllable:  an  initial  error 
in  total  ring  angular  momentum  cannot  be  removed.  Nevertheless,  the  control  law  (13), 
applied  to  the  "uniform"  mode  m = 0 , removes  the  angular  error  y of  all  points  of  the 
ring  by  adjusting  the  cable  length  and  hence  the  radial  increment  x , 
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TYPICAL  SECTION  OF  ARRAY 


Fig.  1 Configuration  of  feasibility  Fig.  2 Array  construction 

demons  trat ion , 
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Fig.  3 Phase-plane  plot  for  the 

in-plane  motion  of  a rigid  Fig,  4 Coordinate  system. 

rod . 
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Fig.  5 Array  of  inertial  tips. 


Fig.  6 Dynamic  model  of  tip  inertia. 


Fig.  7 Libration  of  tip  inertia 
(1/2  cycle) . 


Fig.  9 Zerosof  Aj  and 
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ABSTRACT 


Adaptive  control  techniques  are  being  studied  for  their  future  application 
to  the  control  of  large  space  structures,  where  uncertain  or  changing  para- 
meters may  destabilize  standard  control  system  designs.  The  approach  used  in 
this  paper  is  to  examine  an  extended  Kalman  filter  estimator,  in  which  the 
state  vector  is  augmented  with  the  unknown  parameters.  The  associated  Ricatti 
equation  is  linearized  about  the  case  of  exact  knowledge  of  the  parameters. 

By  assuming  that  parameter  variations  occur  slowly,  the  filter  complexity  is 
reduced  further  yet.  Simulations  on  a two  degree-of -freedom  oscillator 
demonstrate  the  parameter -tracking  capability  of  the  filter,  and  an  implemen- 
tation on  the  JPL  Flexible  Beam  Facility  using  an  incorrect  model  shows  the 
adaptive  f ilter/optimal  control  to  be  stable  where  a standard  Kalman  filter/ 
optimal  control  design  is  unstable. 

INTRODUCTION 


This  paper  contains  the  derivation  of  a simplified,  state  space  adaptive 
f liter /controller , which  can  be  used  in  real  time  to  adapt  to  changing  or 
uncertain  dynamic  models.  The  application  of  the  simplified  extended  Kalman 
filter  approach  to  a two  degree-of -freedom  oscillator  is  compared  to  the  more 
complex  complete  extended  Kalman  filter.  It  will  be  seen  that  the  simplified 
filter  compares  quite  well  with  the  full  order  filter.  In  addition,  the 
simplified  filter  has  been  implemented  on  an  experimental  facility  consisting 
of  a large,  flexible  hanging  beam.  Experiments  on  the  facility  show  that  the 
adaptive  filter/optimal  control  can  stabilize  a system  whereas  the  Kalman 
f ilter/optimal  control  destabilizes  it,  if  the  system  model  is  incorrect. 


As  usual,  the  problem  formulation  begins  with  a dynamic  system  in  state 
variable  format 

X = Fx  + Gu  4-  Tw 

(1) 

z = Hx  + V 


In  the  case  that  the  dynamic  system  in  (1)  is  precisely  known,  a state 
estimator  of  the  following  form  may  be  constructed. 


X = Fx  + Gu  + K(z  - Hx) 


(2) 
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Optimal  estimator  gains  K may  be  selected  minimizing  the  trace  of  the 
estimate  error  covariance,  P = E(xjo^).  Letting  x = x-x,  differencing  (1)  and 
(2)  gives 

* 

X = (F  - KH)x  + Pw  - Kv  (3) 

then 

• 

P = E(xx'^)  + E(xx^)  = (F-KH)P  + P(F-KH)^  + rQP^  + KRK^  (4) 

Picking  K to  minimize  the  trace  of  the  estimate  error  covariance  gives  the 
usual  result 

K = Ph\~^  and  P = FP  + PF^  + rQp’^  - Ph\"^HP  (5) 

Notice  that  for  the  case  of  a precisely  known  dynamic  system,  the  estimator 
gain  may  be  precomputed,  even  in  the  event  of  a time  varying  system.  The 
analysis  used  in  the  case  of  unknown  system  parameters  closely  parallels  the 
preceding  development. 

Adaptive  Control  Formulation 

Adaptive  control  may  be  required  in  the  case  that  the  model  in  (1)  is 
unknown,  uncertain,  or  dependent  upon  the  system  configuration.  The  modifica- 
tions that  need  to  be  made  in  (1)  in  order  to  include  the  effects  of  an  uncertain 
parameter  are  given  below 

X = F(a)x  + G (a)u  + Pw 

a = w (6) 

cL 

z = Hx  + V 

As  can  be  seen  from  (6) , the  system  dynamics  are  now  a function  of  the 
vector  of  parameters,  a.  Furthermore,  these  parameters  are  assumed  to  be 
randomly  varying  constants,  so  that  they  may  be  adjoined  to  the  state  vector. 

An  adaptive  state  estimator  may  now  be  written  as 

• 

X = F(a)x  + G(a)u  + K (z  - Hx) 

t . (7) 

a = K (z  - Hx) 

a 

Now,  both  the  state  vector  and  the  vector  of  parameters  is  updated  using  the 
measurements.  Determination  of  the  adaptive  gains  and  K still  remains. 

3. 
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In  addition  to  the  increased  dimension  of  (9)  over  (5) , the  feature  that 
distinguishes  these  two  cases  is  the  appearance  of  x term  in  (9) . x is  the 
estimate  of  the  state,  and  therefore  cannot  be  known  a priori.  Therefore, 
both  the  state  estimator  and  the  augmented  order  matrix  Riccati  equation  must 
be  solved  in  real  time  in  order  to  implement  the  extended  Kalman  filter.  On 
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line  solution  of  the  matrix  differential  equation  tremendously  increases  the 
required  computation.  Several  approximations  will  be  made  in  an  attempt  to 
reduce  this  computational  complexity. 

Extended  Kalman  Filter  Approximation  (EKF) 


In  the  case  when  there  are  no  uncertain  parameters  (and,  of  course,  under 
suitable  observability  conditions)  , the  estimator  gains  tend  to  a steady  state 
value.  A reasonable  choice  for  simplifying  the  EKF  is  to  linearize  the  solution 
about  the  case  of  perfectly  known  parameters.  Toward  this  end,  it  will  be 
assumed  that  P = Pq  + ePq  + , where  Pq  is  a constant  matrix  and  t is 

a small  parameter,  and  that  the  process  noise  driving  the  parameters  has  a 
spectral  density  of  order  . In  the  limiting  case,  for  e = 0,  the  solution 
reduces  to  the  familiar  steady  state  gains  for  a perfectly  known  system. 

Letting 


F X = 
a 


" 8G 
X + — u 

g.  , 

a 


expanding  (9)  and  collecting  like  powers  of  e gives  the  following  intermediate 
results : 

zeroth  order 


0 « FP  + P + F X P + P ^ (F  x)^  + rqr'^  - P H^r"^HP  (lOa) 

XX  aaxax  a ^ x x 

o o o o o o 


T T T — 1 

0 = P F^  + P (F  x)  - P HR  HP 
ax  a a ax  x 

o o o o 


0 = P HR 

ax  ax 

o o 


(10b) 

(10c) 


first  order 


“ FP  + + F,xP  + P_  ^(F  x)^  - P h\“^HP  " 


Xp  x^  x^ 


a ax^  axj^  a 


X 


^ - P H"r  -"HP  (11a) 

x^  X,  X 

lx  o 


aXj^  aXj^ 


- P^^  h\“^„  - p H^r"V 


ai  a 


ax 


x^  ax^ 


T -1 


-P  H‘R  ^HP  ^ - P hV^ 

^1  axo  axj^  ax^ 


(llb) 

(llc) 


ax 
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From  (10c),  Pax  Using  this  result  in  (10b)  gives  ?a  = 0, 

Using  both  these  results  in  (10a)  gives  the  familiar  steady  state  results  for 
a plant  with  known  parameters,  as  was  expected.  The  results  obtained  from  (10) 
can  now  be  used  to  simplify  (11).  A summary  of  the  simplified  results  is 
given  below: 


zeroth  order 


0 = FP  + P f’’  + rqr^  - p^  h’^r“^hp 

^o  ^o  0 O 


0 = P 


(12a) 

(12b) 


ax 


0 = P 


(12c) 


where 


first  order 


P = (F-K  H)  P + P (F-K  H)^  + (F  x)F  +P 
X,  ^ x-|^  a axj^  ax^  a 


P = P (F-K  H)"^  + P (F^x)"^ 


a. 


0 . 


(13a) 

(13b) 

(13c) 


K 


= P 


H R 


-1 


EX^en  after  simplification,  (12)  and  (13)  require  an  on  line  integration 
of  a matrix  equation  in  order  to  generate  the  adaptive  gains.  A further 
simplification  that  can  be  made  (and  later  be  verified  with  simulation)  is  to 
assume  the  derivatives  on  the  left  hand  side  of  (13)  are  identically  zero. 

This  allows  for  computation  of  the  adaptive  gains  in  terms  of  the  current  value 
of  the  state  using  a simple  matrix  multiplication,  rather  than  a matrix 
integration.  The  final  results  of  this  simplification  are  shown  in  (14) 
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0 

*0 


FP  + p - p^  h'^r"^ 

o 

P (F  - K H)^  + P„ 

\ ® 


(14) 


Adaptive  Control  Simulation 

The  extended  Kalman  Filter  (EKF)  and  the  approximation  to  the  EKF  derived 
in  the  previous  simulation  are  compared  using  a simplified  model  for  the 
computer  simulation.  The  model  used  is  shown  below. 


}— >Wi 


The  equations  of  motion  are; 

• • (»  • 

Mx^  = - Kx^  + kCx^  - x^)  + b(x2  - x^)  + w^ 

mx^  = - kCx^  - x^)  - b(x2  - x^^) 


(15) 


The  simulation  consists  of  modeling  the  system  shown  in  the  previous 
schematic  by  simply: 
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At  t ==  2 on  the  computer  generated  charts  shown  in  Figures  1-6,  a step  change 
in  the  natural  frequency  from  1.0  to  1.25;  i.e.  at  25%  error  in  natural 
frequency  is  made.  The  adaptive  filters  are  supposed  to  detect  this  error 
in  the  presence  of  process  noise  W]^,  a truncated  mode,  and  measurement  noise 
in  the  position  of  M.  Virtually  indistinguishable  results  are  obtained  for 
the  estimator  using  the  full  EKF , vs.  the  estimator  using  the  simplified  EKF. 
On  the  computer  charts,  true  values  are  shown  by  solid  lines,  and  dashed  lines 
show  the  corresponding  estimated  value. 
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The  JPL  Flexible  Beam  Facility 

A facility  has  been  constructed  to  experimentally  validate  a variety  of 
control  technologies  associated  with  large,  flexible  space  structures  and  to 
gain  insight  into  their  implementation.  To  date,  Kalman  filter /optimal  control, 
distributed  control,  and  static  figure  control  have  been  demonstrated  using  the 
Flexible  Beam. 
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The  facility  consists  of  a 12-1/2  foot  long,  by  6 inch  wide,  by  1/32  inch 
thick,  hanging  stainless  steel  beam.  A large  tower  was  constructed  for  the 
sole  purpose  of  supporting  the  beam  and  isolating  it  from  vibrations  in  the 
ground.  Sensing  with  a range  of  + 1 inch  is  performed  by  up  to  four  eddy 
current  sensors,  and  actuation  is  by  three  D.C.  brushless  motors  capable  of 
outputting  5 ounces  to  the  beam. 

All  control  functions  are  handled  by  a SYM  microcomputer , which  uses  a 
6502  microprocessor  operating  at  1 megahertz.  Four  12-bit  digltal-to-analog 
converters  on  two  circuit  boards  allow  interfacing  between  the  microcomputer 
and  the  sensors  and  actuators.  In  addition,  a 9511  floating-point  arithmetic 
processor  performs  the  mathematical  computations  associated  with  digital 
filtering.  Previously,  computations  were  performed  in  software  using  two-byte 
fixed  point  precision;  the  9511  executes  four-byte  floating  point  precision 
calculations  in  approximately  1/5  the  time,  allowing  the  implementation  of  more 
complesc  control  laws  . 

Additional  features  of  the  facility  include  an  ASCII  keyboard,  a video 
monlter  , and  a small  printer.  The  microcomputer  features  an  advanced  moniter , 
for  machine  language  debugging;  an  editor -assembler , for  program  development; 
and  BASIC,  which  provides  the  interface  between  the  user  and  machine  language 
programs.  Commonly  used  routines  are  stored  on  either  erasable,  programmable 
read-only  mem.orles  (EPROMS),  or  digitally  on  inexpensive  audio  cassettes. 
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Figure  7.  The  JPL  Flexible  Beam  Facility 
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A general  purpose  machine  - language  program  has  been  written  for  the 
purpose  of  implementing  control  algorithms.  Due  to  the  predominance  of 
matrices  in  digital  control  design,  the  program  treats  matrices  as  the  primary 
data  type.  Single  instructions  allow  matrices  to  be  multiplied,  added,  stored, 
or  recalled  from  memory.  Additional  instructions  sample  sensors,  output 
controls,  or  set  a timer  (to  maintain  a constant  control  loop  time).  Every 
effort  has  been  made  to  make  the  program  fast  and  easy  to  use. 


Additional  documentation  of  the  facility  can  be  found  in  the  references. 

Beam  Analysis 

An  exact  solution  to  the  partial  differential  equation  for  the  beam 


2 2 2 
-^-r-y(x,t)  + (El  (pg(fc-x) 


= 

8x^ 


f (x) 


(16) 


was  unobtainable  due  to  the  presence  of  gravity.  An  approximation  scheme,  where- 
by the  mode  shapes  were  assumed  to  be  a superposition  of  beam  modes  and  hanging- 
chain  modes,  yielded  the  correct  eigenvalues  only  approximately.  A finite 
element  model  was  therefore  developed  to  obtain  normal  mode  frequendies  and  mode 
shapes.  From  Table  2,  it  is  clear  that  the  predicted  eigenvalues  are  very  close 
to  their  experimental  values. 


Variable 

Value 

Description 

1 

149.875 

in . 

Beam  Length 

P 

.6444 

Ib/ft 

Linear  Density 

El 

424.352 

Ib-ln^ 

Beam  Stiffness 

Table 

1. 

Beam  Parameters 

n 

Finite  Element 

Model 

Experimental 

0 

.308  Hz 

.34 

1 

.755 

.75 

2 

1.38 

1.37 

3 

2.21 

2.15 

4 

3.24 

3.16 

5 

4,47 

4.38 

Table 

2. 

Normal  Mode  Frequencies 
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Implementation  of  Adaptive  Estimation/Control 

A three  mode  (six  state),  plus  the  highest  frequency,  adaptive  filter/ 
controller  using  one  sensor  and  one  actuator  (at  the  lower  end  of  the  beam) 
was  developed  for  experimental  testing.  A digital  implementation  of  (7)  should 
take  into  account  the  time  delays  associated  with  discrete-time  sampling  and 
actuation,  (in  this  case,  .030  sec).  An  attempt  to  derive  a digital  analog 
of  (7)  proved  fruitless;  Instead,  the  filter  gain  Kx  was  obtained  from  a 
discrete  Kalman  filter  analysis  and  the  simple  approximation 

a. , = a.  + K (Z.  - HX.)6T  (17) 

i+1  rail 

was  made  for  the  parameter  updating.  Since  parameter  variations  are  very  slow 
compared  to  system  response  (by  assumption) , this  approximation  should  not 
present  any  problems.  In  addition,  the  control  gains  were  obtained  from  a 
discrete  optimal  control  analysis. 

The  following  plots  show  the  beam  response  to  an  impulse  at  its  lower  end, 
and  the  control  effort  exerted,  for  the  adaptive  estimator /optimal  control  and 
the  Kalman  filter/optimal  control.  In  one  case,  the  assumed  model  is  the 
correct  model,  and  the  other  case,  the  third  eigenvalue  was  intentionally 
made  20%  too  large  to  reflect  an  uncertain  model.  A discussion  of  each  plot 
is  included. 


SUMMARY  AND  CONCLUSIONS 

Although  in  theory,  the  extended  Kalman  filter  is  quite  complex  to 
Implement  in  real  time,  this  paper  has  shown  that  suitable  approximations  can 
make  Implementation  quite  tractable,  while  at  the  same  time  yielding  excellent 
results.  Of  course,  the  virtues  of  adaptive  estimators/control  must  be  weighed 
against  the  extra  computation  time  required;  however,  if  the  system  is  evolving 
in  time  or  the  model  is  inaccurate.,  then  an  adaptive  estimator /controller  is 
ideal . 
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Figure  8 . Natural  Structural  Damping 

An  Impulse  was  applied  to  the  beam  and 
the  resulting  open-loop  response 
recorded.  Note  the  light  structural 
damping  which  is  characteristic  of 
large,  flexible  space  structures.  The 
slight  displacement  offset  of  the  beam 
is  caused  by  a slight  curvature  (due 
to  hard  use) . 


Figure  9.  Kalman  Filter /Optimal 
Control 

In  this  case,  a six -state  Kalman 
f ilter/optimal  control  was  switched 
on  Immediately  following  the  impulse. 
The  resulting  closed-loop  response 
damps  out  much  more  rapidly  than  the 
open  loop  system.  Residual  oscilla- 
tions are  due  to  the  unmodeled  2.2  Hz 
mode.  The  particular  Kalman  filter/ 
optimal  control  design  has  the  best 
performance  of  any  tried  to  date. 
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Figure  10.  Adaptive  Filter /Optimal 
Control 

The  Kalman  filter  was  replaced  by  an 
adaptive  filter  which  estimates  the 
third  modal  frequency  in  addition  to 
the  state.  Since  the  initial  model 
was  exact , the  beam  response  is 
virtually  identical  to  the  Kalman 
filter /optimal  control  response. 


Figure  11.  Kalman  Filter /Optimal 

Control  with  Model  Error 

The  initial  value  for  the  third 
eigenvalue  was  set  20%  too  large. 

The  Kalman  filter /optimal  control 
design  cannot  cope  with  the  erroneous 
model  and  the  system  goes  unstable. 
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Figure  12.  Adaptive  Filter/Optimal 
Control  with  Model  Error 

Again,  the  initial  value  of  the  third 
eigenvalue  was  20%  too  large.  Unlike 
the  Kalman  f liter /optimal  control, 
which  was  unstable,  the  adaptive  filter 
effectively  damps  the  system  after  an 
Initial  identification  period. 


Figure  13.  Memory  Effect  of  the 
Adaptive  Filter 

Once  the  model  has  been  correctly 
identified,  the  closed  loop  response 
is  virtually  Identical  to  that  using 
the  correct  model  initially. 
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SOME  REMARKS  ON  THE  CURRENT  STATUS  OF  THE 
CONTROL  THEORY  OF  SINGLE  SPACE  DIMENSION 

HYPERBOLIC  SYSTEMS* 

D.L.  Russell** 

Department  of  Mathematics,  University  of  Wisconsin 
Madison,  WI  53706 


ABSTRACT 

We  review  various  aspects  of  the  control  theory  of  hyperbolic  systems, 
including  controllability,  stabilization,  control  canonical  form  theory,  etc.  To 
allow  a unified  and  not  excessively  technical  treatment,  we  restrict  attention 
to  the  case  of  a single  space  variable;  the  multi-dimensional  case  is  treated  in 
our  more  extensive  review  [ 36]  . The  paper  concludes  with  a short  discussion 
of  the  newly  developed  procedure  of  canonical  augmentation. 

SOME  ASPECTS  OF  THE  CONTROL  THEORY  OF  THE  WAVE  EQUATION 

AND  REIJVTED  SYSTEMS 

The  systematic  study  of  control  systems  governed  by  partial  differential 
equations,  a special,  but  exceptionally  important,  subcategory  of  distributed 
parameter  sy  stems  began  in  the  early  I960's  with  the  work  of  the  Soviet 
scientists  A.  G.  Butkovskii  [ 3 ] , [4],Yu.  V.  Egorov  [11]  and  others.  These 
works  were  primarily  concerned  with,  the  extension  of  Pontyagin' s Maximum 
Principle  [26]  to  certain  classes  of  processes  which  could  not  be  satisfactorily 
modelled  by  finite  dimensional  mathematical  systems.  Controllability  questions 
were  raised  but  were  usually  subsidiary  to  questions  of  optimality.  One  of  the 
first  systematic  controllability  studies,  in  connection  with  the  heat  equation, 
was  presented  by  Gal'chuk  in  [14]  . One  of  the  most  important  of  the  early 
American  contributions  to  the  subject  was  the  1963  thesis  of  Fattorini  [13]  , 
which  also  treated  parabolic  systems  and  was  one  of  the  first  works  to 
recognize  the  strong  relationship  between  distributed  parameter  control  studies 
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and  classical  results  in  analytic  function  theory. 

The  author' s own  interest  in  distributed  parameter  control  theory  arose 
out  of  consulting  experience  with  Honeywell,  Inc. , and  NASA,  starting  around 
1965  or  1966,  In  developing  the  Saturn  launch  vehicle  for  the  Apollo  program, 

NASA  has  encountered  the  problem  of  transverse  vibrations  of  the  booster 
structure  and  interaction  of  those  vibrations  with  liquid  sloshing  modes  in  the 
immense  Saturn  fuel  tanks.  While  the  eventual  treatment  of  that  problem  was 
based  on  finite  modal  approximations,  the  problem  stimulated  a great  deal  of 
research  aimed  at  an  understanding  of  the  control  of  vibrations  in  various 
distributed  parameter  settings. 

First  looking  at  this  problem,  under  Honeywell -NASA  auspices,  we 

thought  of  modelling  the  booster  structure  as  an  "Euler"  beam,  the  displacement 

w(x,  t),  which  we  may  take  to  be  scalar  here,  satisfying 

2 2 2 

P (^)  9 9 ^ 9 ) — 0 (1,  1) 

dt‘^  9x  9x 

along  with  appropriate  boundary  conditions  including  the  control  inputs,  at  the 
longitudinal  extremities  x = 0,  x = L . We  got  nowhere  with  our  study  of 
this  problem  initially  because  the  equation  (1. 1)  is  not  particularly  well 
understood  from  the  mathematical  standpoint.  There  seemed  to  be  no  "handles" 
to  grasp.  It  would  not  be  until  the  I969  thesis  of  Quinn  [ 27]  that  we  would 
understand  how  this  system  works  and  that  it  is,  in  fact,  controllable  in  a 
rather  strong  sense. 

We  knew  about  the  control  theory  of  ordinary  differential  equations  from 
various  papers  and  from  notes  and  lectures  which  would  later  be  incorporated 
into  the  1967  treatise  on  control  theory  by  Lee  and  Markus  [20]  . We  also 
knew  that  hyperbolic  partial  differential  equations  in  two  independent  variables 
reduce  to  ordinary  differential  equations  satisfied  along  the  characteristics.  It 
was  natural,  therefore,  to  look  for  hyperbolic  models  which  might  fit  our  purpose. 
Such  was  provided  by  the  Timoshenko  beam  equations 

Ip M < li  - - It  <iiw  f > = 0 (I-  2) 

2 

P (3c)  (k(x)  (-^  _ i|j ))  = 0 , (1.  3) 

9t 

which  may  be  viewed  as  two  coupled  wave  equations.  By  "wave  equations" 
here,  we  mean  the  equation 


352 


(1.4) 


r(x) 


9^z 


9t 


r " 


3 

3x 


(s  (x) 


3z 

3x 


) = 0 


All  coefficient  functions  shown  in  (1.2),  (1.3),  (1.4)  are  positive  on  O^xrSL, 

It  may  be  verified  that  (1.  2),  (1.  3)  and  (1.  4)  are  hyperbolic  in  the  sense 
described  in  [ 8]  , [25]  , e.g.  . Since  (1.4)  is  conceptually  simpler,  it  was 
studied  first,  with  accompanying  boundary  conditions 

z(0,t)  = 0 (1.5) 

||(L,t)  = u(t),  (1.6) 

the  latter  incorporating  the  control  force  u (t)  . 

While  the  practical  goal  in  mind  was  appropriate  form  of  stabilization, 
we  knew  that  in  the  case  of  finite  dimensional  systems 

X = Ax  + Bu 


an  affirmative  resolution  of  the  controllability  problem,  steering  from  a given 
x(0)  = Xq  to  a given  x(T)  = Xp  implied  the  property  of  stabilization;  hence 
we  felt  justified  in  first  looking  at  the  state  to  state  controllability  problem  for 
(1,4),  (1.5),  (1.6).  The  "energy"  form  for  (1.4)  is 

£(z,  ■;—  ) = ^ ^ r(x)  (■—  (x,t))^  + s(x)  ( ^ (x,t)  )^dx  . (1.  7) 

Given  initial  and  terminal  states 

z (x,  0 ) = Zq(x)  , (x,  0 ) = y^(x)  (1.8) 

z(x,T)=  z^(x).  If  (x,T)  = y^(x)  (1.9) 

of  finite  enercjy,  i„e.  £(2q.  Vq  ) < <»  , £(Zp  Vj^)  < >»  , we  asked  if  there 

exists  ue  L^[0,T]  for  which  the  solution  of  (1.4),  (1.5),  (1.6)  correspond- 

ing to  the  initial  state  (1.  8)  assumes  the  desired  terminal  state  (1. 9)  at  time 
t = T . The  answer,  a qualified  "yes",  came  from  two  different  approaches  to 
the  problem.  The  relationship  between  these  two  approaches  has,  over  the 
years,  grown  ever  more  fundamental  and  has  led  to  a great  many  very  interesting 
developments.  See  [34]  and  [45]  in  particular. 

The  first  method  explored  was,  as  we  have  already  indicated,  the  method 
of  characteristics.  If  we  let 

and  consider  families  X"^,  X“  of  "characteristic"  curves  satisfying 
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■W-  = 0' 


(1.  11) 


respectively,  and  then  set 

v‘''(x,t)  =~  (x,t)  + c(x)  II  (x,t  ), 

v“(x,t)  = ||  (x,t)  - c(x)  (x,t)  , 

we  see  readily  that  on  X‘^=  {(x‘*'(t),  t)}  , X"  = {(x-  (t),  t)},  respectively, 
we  have 


4 v'^(x+(t),  t)  = c'(x+(t))  -v'(K+(t),t; 

2 


v“(x- (t),  t)  = c'(x-(t))  — 


V (X. 


(1. 12) 

(1. 13) 


t = t > 2Ti 


t = T = 2T, 


x=  0 


Aj  X-(L,T) 


X = 0 


t = T < 2T, 


Aj  X-(L,T) 


X+(L,  0) 


(x,t) 

\ 


X’''(  L,  0 ) 


X = 

L t = 0 X = L 

Fig. 

1,1:  The  Method  of  Characteristics 

Because  these  differential  equations  are  satisfied  on  different  families  of 

characteristics,  the  coupling  between  them  is  more  complicated  than  for  the 

usual  system  of  ordinary  differential  equations.  Nevertheless  there  is  a method 

of  successive;  approximations,  described  in  [30],  [36],  which  enables  solution 

of  these  equations  in  certain  regions  provided  with  appropriate  boundary  data. 

Such  a region  is  the  roughly  triangular  domain  Aq  shown  in  Fig.  1. 1,  bounded 

by  t = 0,  X = 0 and  the  characteristic  X^(L,  0),  of  the  first  family 

described  by  (1. 11),  passing  through  the  point  (L,  0 ) , Together  with  the 

boundary  data  provided  by  (1.  5)  and  (1.  8),  it  may  be  seen  that  the  differential 

4- 

equations  (1.12),  (1.13)  determine  v and  v , and  hence  z(x,  t),  through- 

out the  domain  Aq  . Similarly,  these  equations  together  with  the  data 
provided  by  (1.5)  and  (1.9)  determine  z(x,  t)  in  the  domain  A^  bounded 
by  X = 0,  t = T and  the  characteristic  curve  X"(L,  T),  described  by  the 
second  equation  in  (1.11)  and  passing  through  the  point  (L,  T ) . Thus  the 

initial  and  terminal  states,  described  by  (1.8),  (1.9),  together  with  the 
boundary  condition  (1.5)  determine  z(x,  t)  in  both  Aq  and  Aj^  . 

Whether  Aq  and  A^  are  disjoint,  or  have  a region,  S2q  , of  over- 
lap , depends  on  the  time  T allotted  for  control.  The  time  required  for  the 
+ 

curve  X (0,  L)  to  pass  from  X = L to  x = 0 is 

and  this  is  also  the  time  required  for  X"(L,  T ) to  pass  from  x = 0 to  x = L. 

We  summarize  the  control  situation,  depending  on  the  relationship  between  T 
and  Tj^  . 

Case  T < 2T  . Here  Aq  and  Aj^  overlap  and  the  determinations  of 

z(x,  t)  in  the  overlap  region  ^2q=  Aq  U Aj^  provided  by  (1.8)  and  (1.9)  need 

not  and,  in  general,  will  not  agree.  There  can,  in  such  cases  of  disagreement, 
be  no  solution  of  (1.4),  equivalently  (1.12),  (1.13),  in  the  region 

Rj,  = {(x,  t)l  O^x^  L,  Orst^T}.  The  control  function  u (t),  shown  in 
(1.  6),  never  enters  the  picture  because  it  cannot  affect  the  solution  of  (1.  4)  in 
Aq  or  Aj^  if  (1.8),  (1,9)  are  satisfied  at  t = 0,  t = T,  respectively. 

Case  T = 2T  . Here  the  two  "domains  of  determinacy",  Aq  and  Ap 
just  fail  to  overlap;  their  boundaries  have  exactly  one  point  in  common,  t = Tp 
x = 0.  The  initial  and  terminal  conditions  (1,8)  and  (1,9)  determine  z(x,  t) 

in  Aq  and  Ap  respectively.  Another  process  of  integration  of  the  coupled 
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differential  equations  (1.12)  and  (1.13)  permits  unique  extension  of  z(x,  t), 
equivalently  v (x,t),  v (x,  t),  into  the  domain  n.  The  control  steering 
(1.  8)  to  (1.  9)  is  then  uniquely  determined  from  this  extension  and  (1.  6). 

The  determinations  of  z(x,  t)  in  Aq  and  A^  may  fail  to  match 

smoothly  at  the  point  p : x = 0,  t = T,  . This  results  in  discontinuities  of 

+ + - - 
V along  X ( L,  0 ) and  of  v along  X ( L,  T ) in  general. 

Case  T > 2T  . The  only  difference  between  this  case  and  the  case 
T = 2Tj^  lies  in  the  line  segment  £;  x=0,  Tj^<t<T-Tj,  which  replaces 

the  point  p of  the  case  T = 2T^  . Extension  of  z(x,  t)  from  Aq  U Aj^ 
into  n cannot  be  carried  out  until  the  boundary  condition  (1.5),  which 
yields  ^z/f)t  ( 0,  t ) = 0 , is  augmented  by  arbitrary  data 

||(0,t)  = ^(t)  , (0,t)  e i . (1.15) 

Once  this  is  done,  extension  of  z(x,  t)  into  proceeds  much  as  before. 
(See  [30],  [36]  for  details  of  the  extension  process. ) The  arbitrary 
function  ^(t)  can  be  designed  so  as  to  eliminate  discontinuities  of  the 
solution  along  X"^(L,  0)  and  X"(L,T),  to  satisfy  some  criterion  of 
optimality  ( see  [ 30]  e.  g.  ) or  to  fulfill  any  other  appropriate  design  objective. 

If  the  partial  differential  equation  (1.  4)  is  combined  with  boundary 
conditions  different  from  (1.  5),  (1.  6),  but  still  admissible  for  (1.  4),  the 

cases  T < 2T^  T > 2T^  remain  as  above.  The  rather  delicate  situation 
at  T = 2Tj  depends  on  the  specific  form  of  the  boundary  conditions.  For 
example,  the  boundary  conditions 

z(0,t)=0,  z(L,  t)  = u(t) 

lead,  in  case  T = 2T^  to  a situation  where  the  desired  control  is  not  unique; 
it  has  the  form 

u (t)  = ii(t)  + y u(t) 

where  u (t)  is  a non- zero  control  steering  the  zero  initial  state  into  the  zero 
final  state  and  y is  an  arbitrary  constant.  By  contrast,  the  boundary 
conditions 

H(0,t)  = 0,  ||(1,  t)=u(t)  (1.16) 

lead,  in  case  T = 2Tj,  to  a situation  where  the  desired  control  u(t)  does 
not,  in  general,  exist.  (See  [31],  [37]  for  more  details. ) 

The  analysis  of  more  complicated  systems  of  hyperbolic  equations,  such 
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as  the  Timoshenko  system  (1.  2),  (1.3),  is  in  general  rather  complicated  but 
there  are  some  special  cases,  including  appropriate  boundary  conditions,  for 
which  the  analysis  is  fairly  simple.  In  [M]  a discussion  is  given  permitting 
analysis  of  the  free  boundary  case 

||(0,t)  = 0,  qj(0,t)  - U (o,t)  = 0,  (1.17) 

||  (L,t)  = u^(t)  , il;(L,t)  --g  (L,t)  = U2(t)  . (1.18) 

It  may  be  shown  that  all  cross -coupling  is  of  low  order  and  the  problem  is 
essentially  equivalent  to  two  problems  (1.  4)  with  boundary  conditions  (1. 16). 
Two  critical  times  are  involved.  With  (cf.  (1.14)) 


^ ip(x)  > ’ 

m dx 

^1  - Jq  c^(x)  ' 

(1. 19) 

c fxl  = 

°2^^^  ' p(x)  ^ ' 

rp  dx 

h - c^cx)  - 

(1.  20) 

it  may  be  shown  that  finite  energy  states  are  controllable  if  and  only  if 

T > 2 max{Tp  T2}  . 

The  essential  details  of  the  analysis  are  given  in  [30]  and  are  quite  similar 
to  what  we  have  briefly  outlined  here  for  (1.  4),  (1.  5),  (1.  6)  . 

It  is  immediately  clear  that  the  method  of  characteristics  is  specially 
adapted  to  controls  u(t)  acting  at  a point,  as  in  (1.6).  This  is  true 
because  the  control  determination  occurs  at  the  very  last  stage  of  the  analysis, 
after  the  controlled  solution  has  been  computed.  If  the  control  u (t ),  itself 
scalar,  acts  on  the  system  through  a "control  distribution  function"  g (x),  as 
in  (cf.  (1.4)) 

2 

r (x)  ^ ^ ® li  ) = g (x)  u (t) , (1.  21) 

9t 

homogeneous  boundary  conditions  (cf.  (1.5),  (1.6)) 

z(0,t)  = 0,  ||(L,t)  = 0 (1.22) 

applying  at  the  boundaries,  we  face  what  appears  at  first  glance  to  be  a rather 
different  situation  than  what  obtains  in  (1.  4),  (1.  5),  (1.  6),  for  even  the 
equations  corresponding  to  (1.12),  (1.13)  will  involve  the  unknown  control  u(t) 
in  this  situation;  one  cannot  proceed  by  filling  out  z(x,  t)  in  successive 
domains  as  before;  a completely  different  approach  is  required.  Such  an 
approach  can  be  found  in  the  study  of  moment  problems  - a technique  developed 
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by  several  authors  (see  [3],  [12],  [15],  [14],  [23]).  The  technique  has 
the  advantage,  from  the  point  of  view  of  approximation  of  being  intimately 
connected  with  the  modal  representation  of  the  system  based  on  the  natural 
modes  of  vibration,  or  eigenfunctions  of  the  operator  -r(x)“\8/ax)(s(x)(9z/ax)). 

It  is  known  ( see  [ 1 ] , [ 7 ] ) that  the  operator 

Lz  = -r(x)"^  ) (1*23) 

with  boundary  conditions  conformable  with  (1,  22)  has  eigenvalues 

\ ’ k = 1,  2,  3,  . . . (1.  24) 

^1 


where  the  are  uniformly  bounded  and  is  related  to  c(x)  by  (1, 14). 

The  corresponding  eigenfunctions,  ^^(x) , k = 1,  2,  3,  • • • , form  an  ortho- 
normal basis  for  [O,  L]  (which  consists  of  the  same  functions  as 

2 ^ 

L [O,  L]  but  has  the  inner  product 


/ r(x)<?j(x)ilj(x)dx)  . 
0 


(1.  25) 


Every  finite  energy  solution  z(x,  t)  of  (1.21),  (1.22),  i.  e.  every  solution 

for  which  the  integral  (1.  7)  is  bounded  for  all  t,  can  be  expanded  in  the  form 

00 

z(x,t)  = Yj 
k=l 


where,  if  we  assume  the  control  distribution  function 

00 


g(x)  = Y 
k=l 


convergent  in 


Letting 


2 

L [0,  L]  , the  z.(t)  satisfy 
r Jc 

Zk  + ^k  ^k  ""  ^k  k=l,  2, ... 

i 

03  j,  — > k=l,  2,  3,*** 


and  using  the  transformation 


one  arrives  at  the  system 


f (x)  has  the  expression 


(1.  26) 


(1.  27) 


u(t)  . (1.  28) 
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It  may  be  seen  that  finite  energy  states  are  those  for  which 
00 

2 \ 3 < “ 

k = 1 

and  this  becomes,  in  terms  of  , 

I + Ukl^>  ^ “ • 

k=l 


Integrating  (1,  28), 

i^kT 

\(S)  - e 


CkCt)  - 


we  have. 

for  T > 

0 , 

^k 

nk(o)  = 

e u ( t ) dt 

^k  " 

iwk(T-t) 

II 

u ( t ) dt 

Assuming  the  controllability  condition 


(1.  29) 


g^  0,  k = 1,  2,  3, . . • 

we  see  that  the  problem  of  steering  between  the  given  states  at  times  0 and  T 
reduces  to  the  moment  problem 


T itOkS 


o;. 


f ( s ) ds  = 


T “iwkS 


, k = 1,  2,  3, 


/ e 
0 


f(s)ds  = , k = 1,  2,  3,  • • • , 


where  s = T-t  , f (s)  = u (T  - s),  and 


ico,  T 


-ioj  1.T 


a 


k = 2(T)^(T)-e  ^ 11^(0)),  Pk=  CktO)) 


(1.  30) 
(1.  31) 

(1.32) 


are  square  summable. 

To  solve  the  moment  problem  we  resort  to  the  theory  of  nonharmonic 
Fourier  series  as  developed  by  Paley  and  Wiener  [ 24]  , Levinson  [ 21]  , 
Schwartz  [42]  and  many  others.  (An  excellent  expository  treatment  [49]  by 
R.  Young  has  recently  appeared.)  The  following  is  known;  the  three  cases 
being  divided  in  a manner  conformable  with  the  three  cases  discussed  earlier. 

icoi-  ^ 

Case  T < 2T]^ . The  functions  e , e , k = 1,  2,  3,  • • • , are 
lineally  dependent  in  L [0,T]  in  a rather  strong  sense.  Any  one  of  these 
functions,  indeed,  any  finite  number  of  them,  lie  in  the  closed  span  of  the 
remaining  functions  (which,  in  fact,  is  equal  to  the  whole  space  L [0,T]  ). 
As  a result  the  moment  problem  (1.30),  (1.31)  cannot,  in  general,  be  solved. 
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Suppose,  e.  g. , all  but  finitely  many  of  the  a^,  say  k = K+1,  K + 2,  •••, 

were  equal  to  zero,  while  some  of  the  k = 1,  2,  • • • , K,  are  non-zero . 

The  linear  dependence  just  referred  to  shows  that  such  a problem  can  have  no 

solution;  the  equations 

«T  iw^s  T iw^s 

f e f(s)ds  = j e f(s)ds  = 0,  k > K 
0 0 

imply  that  the  same  equations  must  hold  for  k :s  K . 


Case  T = Tj.  Here  the  functions  e 
2 


"•X CO 


, k = 


form  a Riesz  basis  for  L [ 0,  2Tj^]  . Every  function  h e [ 0,  2Tj^] 
unique  convergent  expansion 


i oji.  s 


-lC0i,S 


h(s)  = 2 CV  + ] 

k=l 

and  there  are  positive  numbers  c,  C,  such  that 


-2, 


L10,2T]  ^ L^[0, 2T] 


1,  2,  3,  • . 

has  the 

(1.  33) 
(1.34) 


Further,  there  is  a unique  dual  basis  of  biorthogonal  elements  Pj^,  P ^ e 
L^[  0,  2T,  ] such  that 

l ^ITI 

1 ® k = ±1,  ±2,  ••• 

/ e p (s)ds  = 6 
0 ’ = ± 1,  ± 2,  . . • 


“ 


I*' 

I 


(1.35) 

k = £ 
k/^Jl 

which  engenders  expansions  similar  to  (1.33),  the  roles  of  c,  C in  the 
inequalities  parallelling  (1.  34)  being  reversed.  The  formal  solution  of  (1.  30), 
(1.  3l)  is  then  uniquely  given  by 


*<^)=  E -5r[“kPk(“>  + \p-k<®>i  • 


(1.3  6) 


If  we  have 

lim  1 Oi,  1 = 0 , (h  37) 

k CO  ^ 

2 

as  would  be  the  case,  e.  g. , if  g e L [ 0,  L]  , then  the  conditions  for 
convergence  of  (1.  36)  are  more  stringent  than  just  the  square  summability  of 

the  Oil. , given  by  (1.  32).  We  need 

“ 2 Pt,  2 

V (i_Lf  + l_!s.  1^)  < „ . 

k=l  ^ 
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As  a consequence  we  can  steer  (1.  21)  from  any  finite  energy  initial  state  to  a 

dense  (in  the  energy  norm)  subspace  of  final  states,  or  vice  versa,  but  we 

cannot  steer  between  arbitrary  finite  energy  states  during  [O,  2Tj^]  if  (1.  37)  is 

true.  The  case  of  boundary  control  (1.  6),  already  treated  by  the  method  of 

characteristics,  will  be  discussed  more  extensively  below.  In  that  case  the 

coefficients  g^^  in  (1.  26)  ave  bounded  and  bounded  below.  The  result  is, 

in  that  situation,  that  we  obtain  the  same  result  this  way  as  by  the  method  of 

characteristics  - given  finite  energy  initial  and  terminal  states,  there  is  a 

2 

unique  control  u e L [O,  2Tj^]  steering  the  one  to  the  other. 

Case  T > 2Ti  . The  main  difference  between  this  case  and  the  preced- 
'■  iwvS  -iu>vS 

ing  is  that  here  the  functions  e , e , k = 1,  2,  3,  • • • , form  a 

Riesz  basis  for  a proper  subspace,  E,  of  L [0,T].  The  biorthogonal 

functions  p^^, , p_^  exist,  but  are  unique  only  if  we  require  that  they  lie  in 

E — or  we  impose  some  comparable  condition.  If  we  agree  that  p^,  p ^ 

belong  to  E , then  any  elements 

'’k  = Pk  + 'ik'  p-k=p-k  + <J-k 

I 2 

with  q^^,  q,|^eE  CL[0,T]  still  form  a biorthogonal  set  relative  to  the 
iu)v®  -iwus 

e ^ , e . The  convergence  properties  of  series  involving  the  p,  , 

are  much  the  same  as  in  the  preceding  case.  As  a result  we  have  the 
same  control  capability  as  in  the  case  T = 2Tj^  but  controls  are  not  unique  . 
Indeed,  if  u is  a control  steering  between  two  given  states,  the  family  of 

rsj  /\  </\  ^ 

controls  u -fu,  u e E , all  realize  the  same  control  objective.  Again, 

this  non-uniqueness  should  be  compared  with  the  similar  property  observed  for 

T > 2Tj  in  applying  the  method  of  characteristics. 

Using  the  theory  of  distributions  and  related  material,  boundary  value 

control  situations  such  as  (1.  6)  can  be  included  in  the  same  framework  as 

2 

(1.  26)  but  with  g in  a larger  space  than  L [O,  L]  ; g should  be  a linear 
functional  (in  general  unbounded  on  L [0,  LJ  ) whose  domain  includes  the 
domain,  J5(E),  of  the  self  adjoint  operator  L,  given  by  (1.23),  with  the 
given  homogeneous  boundary  conditions.  The  g^^  are  the  values  which  g 
assumes  at  the  eigenfunctions  £ J9  (L)  . A detailed  study  of  these 

"admissible  input  elements"  is  provided  in  [17]  . In  this  way  a unification 
of  the  boundary  and  distributed  control  cases  may  be  achieved.  One  consequence 
of  this  is  that  the  biorthogonal  functions  p]^,  p_;j^  which  play  such  an 
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important  role  in  the  method  based  on  the  moment  problem  (1.  30),  (1.  3l)  can 

actually  be  obtained  through  the  more  constructive  method  of  characteristics  as 

controls  steering  from  a zero  initial  state  (say)  to  final  states  constructed 

using  a single  eigenfunction  <p^  of  L . 

We  began  our  discussion  here  with  the  Euler  beam  equation  (1. 1)  . For 

definiteness,  let  us  add  a distributed  control  term  (scalar  input)  and  specific 

boundary  conditions  so  that  we  have 

2 2 2 

p(x)-^  +-^(EI(x)-^)  = g(x)u(t)  (1.38) 


(1.  38) 


(0,t)  = 


(0,t)  = 


(1.  39) 


(L,t)  = 


(L,t)  = 0 


(1.  40) 


In  1969  J.  P.  Quinn,  in  his  doctoral  thesis  [27],  studied  the  controllability 
properties  of  a class  of  systems  including  this  one.  Here  the  operator 


Aw  = 


1 

pM 


(EI(x)^) 


on  the  domain  in  H^[  0,  L]  consisting  of  functions  obeying  boundary 

conditions  conformable  with  (1.39),  (1.40)  has  eigenfunctions  (x)  forming 

2 ^ 
an  orthonormal  basis  for  L [O,  L ] and  the  corresponding  eigenvalues 

4 p i ^ 

grow  like  k as  k -►  00  . With  ‘‘'k  “ ^k  obtain  a system  similar 
to  (1.28)  , using  a transformation  like  (1.  27)  applied  to  the  second  order 
differential  equations  resulting  from  the  eigenfunction  decomposition; 

Wk  k = 0,  1,  2,  3,  ... 

( a slight  modification  of  (1.27),  (1.28)  is  necessary  for  Xq  = 0 ; see  [34]). 
Again  there  results 


(1.41) 


and  the  energy  expression  for  (1.  38)  is,  equivalently, 
i /^p(x)  (|^)^  +EI(x)  (l!|:)^]  dx 
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i E 

k=0 


or 


' + ICkl" 


all 


» E I’ll. 

k=0 

for  "finite  energy" 


states. 


iw,  s 


“1  wj^s 


Quinn  was  able  to  show  in  this  case  that  the  functions  e , e 

iw^s  -iu)i,s 

k=  0, 1,  2,  3,  •••  (for  k = 0 replace  e , e by  1,  s ) are 

2r 

linearly  independent  in  L [0,T]  for  every  T > 0 (this  result  by  itself 
had  already  been  obtained  much  earlier  by  Ingham  [18]  who  shows,  in  effect, 
that  these  functions  form  a Riesz  basis  for  a closed  subspace  of  L [ 0,T]  for 
every  T > 0)  and,  additionally,  that  there  is  a positive  number,  M(T  ), 

p T,(s)  are 


such  that  if  the  (non-unique)  biorthogonal  functions  Pp,(s), 
appropriately  selected  in  L^[0,T]  , these  functions  are  continuous  and 
satisfy  the  pointwise  bounds 


p^(s)l  ^ M(T),  1p_^(s)1  ^ M(T),  s e [0,T]  . (1.42) 

■ “ -icoj^s 

e form  a Riesz  basis  for  a closed  subspace 


ICOkS 


The  fact  that  the 

of  L'[0,T],  T>0,  implies  that  initial  states  and  terminal  states  with 

(in  terms  of  (1.41))  expansion  coefficients  0 ' ^k  0 ^k  1’  '^k  1 

can  be  steered,  one  to  the  other,  during  [0,T]  with  ue  L^[0,t], 
provided  that 

V ^k.O  ,2  , ^k.  0 I ^ V /I  ^k.l  ^ I ^k.l  ^ 

k=0  ^ ^ k=0  ^ ^ 


The  boundedness  property  (1.42)  shows  we  can  also  control  states  for  which 


CO 

Z (1 

k=0 

3k 

. +1 

^k,  0 
^k 

1 ) < “ , 

CO 

E (1 

k=0 

1 4- 

1 ^k,l 

1 ) < “ » 

% 

1 + 

' 3, 

this  being  possible  with  a 

control  function 

u (t)  uniformly  bounded  and 

continuous  on  [0,T]  . 

We  have  noted  in  connection  with  the  Timoshenko  beam  system  (1.  2),  (1. 13) 
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(1. 17),  (1. 18),  that  an  adequate  control  theory,  based  on  the  method  of  character- 
istics, exists  when  we  have  two  separate  control  functions,  Uj^(t)  and  U2(t), 
with  which  to  control  the  lateral  deflection  and  shear  deformation  separately. 

An  open  question  is  the  adequacy  of  control  using  a single  control  input,  so 
that  (1.17),  (1.18)  becomes,  e.  g., 

^(L,t)  = Q!U(t),  iHL,t)-  (L,t)  = Pu(t) 

z z 

with  a + 3 ji^  0 . This  problem  is  a special  case  of  the  more  general 
question  of  the  controllability  of  linear  hyperbolic  systems  of  dimension  n = 2m, 
involving  m pairs  of  characteristics,  each  pair  describing  a given  wave  mode 
propagating  in  two  opposite  directions,  by  means  of  fewer  than  m control 
inputs.  Some  work  has  been  done  in  this  direction  by  R.  G.  Teglas  in  his 
thesis  [45]  and  by  N.  Wick  [47]  , but  it  is  safe  to  say  that  no  very  general 
criteria  for  this  problem  have  yet  appeared.  Particularly  valuable,  it  seems 
to  this  author,  would  be  a study  of  the  Timoshenko  beam  system  from  the 
singular  perturbation  standpoint,  elucidating  the  behavior  of  solutions  and 
controllability  properties  as  the  modulus  of  elasticity  in  shear,  k (x)  in  (1.  2), 
(1,  3),  tends  to  infinity. 

STABILIZATION,  CANONICAL  FORMS,  EIGENVALUE 
PIACEMENT,  etc. 

As  all  practicing  engineers  will  know,  controllability  in  itself  is  rarely 
the  prime  goal  of  control  system  design.  Stability,  and  related  criteria  such  as 
robustness,  insensitivity  to  particular  input  frequency  bands,  etc.,  are  more 
commonly  uppermost  in  mind.  Additionally,  there  is  the  question  of  state 
estimation  from  lower  dimensional,  noisy  observations  in  order  to  implement 
linear  feedback  control  policies.  These  subjects  have  been  pursued  almost  ad 
nauseum  for  linear,  finite  dimensional  systems.  In  the  case  of  distributed 
parameter  systems,  and  hyperbolic  systems  in  particular,  the  literature  on  this 
subject  remains  rather  sparse  and  spotty  in  its  coverage. 

As  in  the  case  of  linear  finite  dimensional  systems,  stability  and  stabili- 
zation studies  for  linear  partial  differential  equations  have  tended  to  cluster 
around  two  dominant  approaches:  the  Liapounov  approach,  primarily  carried  out 

in  connection  with  systems  involving  some  form  of  "conservation  of  energy"  law, 
and  the  spectral  approach,  determining  if,  or  making  certain  that,  the 
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eigenvalues  of  the  system  lie  in  an  appropriate  subset  of  the  left  half  plane.  The 
spectral  approach  suffers  from  the  disadvantages  of  greater  intricacy  of 
computation  and  the  need  to  show  that  the  spectrum  location  does,  in  fact, 
determine  the  asymptotic  behavior  of  the  system.  The  latter  brings  in  questions 
of  completeness  and  linear  independence  of  the  eigenvectors  of  the  system. 

We  will  begin  with  a short  discussion  of  what  has  been  done  with 
Liapounov  methods.  On  the  theoretical  side  one  can  start  with  a system 

X = Cx,  (2.1) 

C generating  a strongly  continuous  semigroup  S(t)  in  the  Hilbert  space  X 
(we  may  have  started  with  a control  system  x = Ax  + Bu,  set  u = Kx,  then 
C = A + BK ) „ We  set  up  a quadratic  functional 

V(x)  = (X,  Qx)  , 

where  Q is  a bounded,  positive,  self  adjoint  operator  on  X with  Q > ql 
for  sc'me  q > 0 , to  serve  as  a Liapounov  function.  One  may  then  show  that 

for  t2  > tj^  and  x(t)  = S(t)xQ  a "solution"  of  (2.1),  that 

(x(t2),  0x(t2))  - (x(tj),  Qx(tp) 
to 

= - r (x  (s),  Wx  (s))  ds 

t 

^1 

for  some  positive  self  adjoint  operator  W so  that,  in  some  sense  which  one 

needs  to  make  precise  in  individual  cases, 

c'"q  + QC  + W = 0 , (2.2) 

the  Liapounov  operator  equation,  is  satisfied.  An  important  result,  due  to 

Datko  [lO]  , states  that  if 
00 

f (x(t),  Qx(t))dt  < <»  (2.3) 

0 

for  every  initial  state  Xq  e X,  then  the  semigroup  S(t)  is  exponentially 
damped,  i.  e, 

llS(t)ll  ^ Me"^^  , t & 0 , 

for  positive  numbers  M,  y . The  condition  (2.3)  is  satisfied  if  W > wl 
for  some  w > 0 , as  may  easily  be  verified. 

Consider  the  linear  symmetric  hyperbolic  system  in  L^[  0,  L]  = 

(L^[0,  L]  f 

E(x)|^  = A(x)^  +B(x)w+f(x,t)  (2.4) 

where  E (x),  A(x),  B (x)  are  continuously  differentiable  mXm  matrices 
defined  for  x e [ 0,  L ] , E (x)  symmetric  and  positive  definite,  A (x) 
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symmetric.  The  wave  and  Timoshenko  equations  can  be  written  in  this  form. 

The  "energy"  usually  is  expressed  as 

L 

e(t)  = i/  (w(x,t),  E (x)  w(x,t))dx  . 

0 

With  appropriately  "conservative"  or  "dissipative"  boundary  conditions  at 
X = 0,  X = L , one  finds  that  for  t-  > t, 

Jj  ^ * • 

e(t2)  - e(tp  = f (i  f (w(x,t),  [B(x)  + B(x)"-A(x)]  w(x,t))dx 

tf  0 

+ / (w(x,  t),  f (x,  t))dx  } dt  . 

0 

If  B (x)  + B (x) - A (x)  is  uniformly  negative  definite  or  if  the  n dimensional 
control  function  f (x,  t)  may  be  arbitrarily  specified  as  a function  of  x and 
t,  one  may  use  feedback 

f (x,t)  = K(x)w(x,t)  (2.5) 

in  such  a way  that  . 

^2  L 

e(t2)  " W(x)  w(x,  t ))dx  dt  (2.6) 

tf  0 

with  W(x)  uniformly  positive  definite  and  symmetric  on  [0,L].  Then  one 
can  apply  Datko's  result,  or  more  simple  arguments,  to  show  that  solutions  of 
(2.3),  (2.4)  are  uniformly  exponentially  damped  in  Lj^[0,L]  norm. 

Note,  however,  that  if  B (x)  + B (x)  ‘ - A (x)  = 0 or  for  some  other 
reason  fails  to  be  positive  definite,  and  if 

f(x,t)  = D(x)u(x,t)  or  f (x,t)  = D (x)u  (t) 
with  dim  u (x,  t)  = r < R in  the  first  instance,  u a function  of  t only  in 
the  second  instance,  then  we  cannot,  in  general,  achieve  (2.5)  with  W(x) 
uniformly  positive  definite.  Comparable  difficulties  arise  when  boundary 
control  is  employed.  In  such  cases  it  is  a form  of  the  La  Salle  "invariance 
principle"  (see,  e.  g.  [19])  which  must  be  appealed  to,  rather  than  the  basic 
Liapounov  theory,  for  an  analysis  of  presumed  asymptotic  stability  properties  of 
the  system.  This  has  been  discussed  in  some  detail  in  [36]  and  [33]  and 
we  give  only  the  briefest  outline  here. 

The  "invariance  principle",  as  it  applies  to  finite  dimensional  systems, 
relies  heavily  on  the  compactness  of  the  " w-limit  set"  of  the  system  in  order  to 
reach  the  final  conclusion  of  asymptotic  stability.  Comparable  compactness 
properties  associated  with  the  solutions  of  an  infinite  dimensional  system  are 
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generally  difficult  to  realize  but  the  initial  attempts  to  extend  the  theory  nonethe- 
less relied  on  establishing  some  sort  of  compactness  property.  One  of  the  first 
contributions  in  this  direction  was  due  to  Dafermos  [9]  who  studied  weak 
damping  of  the  wave  equation,  relying  on  the  almost  periodic  nature  of  the 
system  solutions  to  provide  the  required  compactness,  Slemrod  [43]  studied 
the  boundary  damped  wave  equation  by  introducing  suitably  weakened  topologies 
— as  compared  with  the  usual  topology  associated  with  the  energy  norm  — and 
was  able  to  conclude  a correspondingly  weakened  form  of  asymptotic  stability. 

Knowing  that  controllability  implies  stabiliz ability  in  the  case  of  autono- 
mous finite  dimensional  linear  systems,  we  are  not  surprised  to  find  control- 
lability playing  a role  in  the  study  of  asymptotic  stability  and  stabilization 
properties  of  autonomous  infinite  dimensional  linear  systems.  This  is  discussed 
in  some  detail  in  the  paper  [28]  by  J.  P.  Quinn  and  the  author  and  also  in 
[33]  . Systems  of  the  form  (2.3),  but  with  the  control  appearing  in  the  bound- 
ary conditions,  are  studied  in  [28]  prior  to  the  main  discussion  on  the  boundary 
damped,  higher  dimensional  wave  equation.  We  can  give  an  idea  of  the  flavor 
of  the  arguments  employed  using  a simple  example  based  on  the  wave  equation 
(1,  4)  with  control  appearing  in  the  boundary  conditions  (1,  5),  (1,  6),  If  in  this 
system  one  employs  the  feedback  law 

9w 


u(t)  = -y  ~ (L,t), 


(2.7) 


the  closed  loop  system  is  (1.4),  (1.5)  together  with  the  "closed-loop"  Robin 

type  boundary  condition 


9w 

9X 


9w 


(L,t)  + Y-^(l,t)  = 0 . 

Here  a short  computation  shows  that  with  the  energy 
expression  (1.7)  we  have,  for  t2  > tj. 


(2.8) 

e (t ) defined  by  the 


- e(tp  = -yp(L)  / 


9 w , T \ 2 

g-j.  ( b,  t ) dt  . 


(2.9) 


It  is  not  feasible  to  fit  this  situation  into  the  general  pattern  based  on  the 
Liapounov  operator  equation  (2,  2)  but,  since  we  expect  (correctly)  that,  along 
with  (1.4),  (1.5),  (2.8) 

^ (L,t)  = 0 w(x,t)  = 0 , 

an  "invariance  principle"  type  of  argument  appears  to  be  in  order.  But  we 
will  use  a variation  on  this  procedure  which  makes  use  of  the  controllability 
already  established  in  Section  1.  Let  v (x,  t)  be  a controlled  solution  of 
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(1. 4),  (1,  5),  (1,  6),  u (t)  being  selected  so  as  to  steer  the  initial  state 

v(x,  0)  = w(x,  0),  (x,  0)  = -^  (x,  0),  (2.10) 

agreeing  with  the  initial  state  of  the  solution  w(x,  t)  of  (1.4),  (1.5),  (2.8), 
to  the  zero  final  state 


v(x,Tj)  = 0, 


- 0, 


(2.11) 


as  described  earlier.  Defining  the  "energy  inner  product” 

(w  ( . , t ),  V ( • , t ))  = f [p  (x)  (x, t ) (x,  t)  + p(x)  (x,  t)|~(x,  t)]  dx 

it  is  found,  using  (1.5),  (2.8),  (2.10),  (2.11),  that 

(w(.,0),  v(-^^O))  -(w(.,2Tp,  v(-,  2Tj^))  = llw(-,  0)H|  = 

= -P(L)  / ^[|f  (L,t)|f(L,t)  +|j(L,t)-^(L,t)]  dt 


= p(L)^  1 [Y|~(L,t)  +-g(L,t)]dt 

2T 

= P(L)/  ^ -^(L,t)[Y  If  (L,t)  +u(t)]dt. 

2 

Here  the  energy  norm  at  t=  0,  is  2e(0)  . Applying  the 

o 

Schwartz  inequality 

?T 

n ^ 1 r\ o 


4e(0)‘^  ^ P(L)/ 


’1  8w 


(L,t)‘'  dt 


(2. 12) 


(Y-^(L,t)  +u(t))'"dt. 

A slightly  more  detailed  study  of  the  control  problem  for  (1.  4),  (1.  5),  (1.  6)  in 

the  case  T = 2T,  (or  T > 2T. ) shows  that  control  from  an  initial  state 
w(x,  0),  (x,  0)  to  0,0  at  time  2Tj^  is  realized  with  a control  u(t) 

which  satisfies 

/ u(t)^dt  ^ K.e(O) 

0 ^ 

and,  for  the  resulting  controlled  solution  we  have 

2T 

/ 1 If  (L,t)^dt  ^ K^e(O) 

for  certain  positive  constants  K„  and  Kn  . Then  (2. 12)  easily  yields 
2T  u J. 

Yp(L) / (L,t)^dt  > = Kg(0) 

0 2(K(j+Y‘^K^)e(0) 
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(2.13) 


and,  setting  t^  = 0 , = 2Tj^  in  (2.9),  we  have 

e(2Tj)  < £(0)  - Ke(0)  = (I-K)e(o) . 

Since  e(2T|^)  is,  from  (2.9),  (2.13),  positive  and  less  than  or  equal  to  £(0) 

we  conclude  0 < 1 - K < 1 . 

Repeating  the  above  argument  on  successive  intervals  [O,  2T^]  , [ 2T^ 
4T^]  [2kTp  2(k+l)Tj^]  , . , . and  using  the  monotonicity  of  g(t),  as 
implied  by  (2.9),  we  conclude  that  g(t)  decays  exponentially  to  0 as  t-^<». 
The  same  general  argument  can  be  used  with  a fairly  wide  class  of  boundary 
damped  linear  symmetric  hyperbolic  systems  (2.  4)  and  with  many  other  systems 
which  are  energy  conserving  in  the  uncontrolled  situation  and  suitably  strong 
controllability  properties.  The  Timoshenko  system  (1.  2),  (1,  3),  v/ith  appropriate 
boundary  conditions,  is  in  this  class.  As  far  as  the  author  is  aware,  the  Euler 
beam  model  (1. 1)  has  not  yet  been  studied  from  this  point  of  view. 

The  spectral  approach,  as  we  have  already  indicated,  involves  a direct 
analysis  of  the  eigenvalues  and  eigenfunctions  or,  more  generally,  the  spectrum 

and  invariant  subspaces,  of  the  generating  operator  C for  a given  system 

• • 

X = Cx,  possibly  derived  from  a control  system  x = Ax  + Bu  by  the  use  of 
linear  feedback  u = Kx  so  that  C = A 4 BK  . A fairly  common  case,  which 
can  be  treated  with  minimal  difficulty,  arises  when  all  but  finitely  many  of  the 
eigenvalues  of  C have  negative  real  parts.  Under  generically  valid  control- 
lability-type conditions  it  is  then  possible  to  move  the  unstable  eigenvalues 
into  the  left  half  plane  while  either  keeping  the  stable  eigenvalues  fixed  or  else 
maintaining  a certain  margin  of  stability.  Work  of  this  sort  has  been  carried  out 
by  Tri.ggiani  [46],  Sakawa  [40]  , [41]  and  others. 

A somewhat  more  challenging  task  arises  when  one  starts  with  a system 
having  infinitely  many  eigenvalues  in  the  closed  right  half  plane  (usually  one 
considers  a conservative  system  wherein  all  of  the  eigenvalues  of  C are  purely 
imaginary)  and  one  attempts  to  devise  a feedback  law  to  move  all  of  these 
eigenvalues  over  into  the  open  left  half  plane.  A number  of  procedures  have  been 
examined  in  this  connection. 

In  [32]  a second  order  system  with  scalar  control 

X + Ax  = bu,  X,  b e X , (2.14) 

is  studied,  X being  a real  Hilbert  space  and  A an  unbounded  positive  self 
adjoint  operator  on  X . Assuming  that  A has  a Riesz  basis  of  eigenvectors 
k = 1,  2,  3, . . . , in  X,  and  corresponding  positive  eigenvalues  , 
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(2. 15) 


increasing  with  k , k = 1,  2,  3,  • . • , x and  b may  be  expanded  as 

00  00 

X = 2 *=  = S 

k=l  k=l 

convergent  in  X , with  square  summable  coefficients.  We  assume  the 
minimal  condition  for  approximate  controllability 


\ 0 , k = 1,  2,  3,  • • • . 

• • 

The  energy  form  is  ^ [ (x, x)  + (x,Ax)]  = g and  elementary  computations 
show  that  for  (2. 14)  and  for  any  T > 0 


T 

e(T)-g{0)=  f (x(t),b)u(t)dt  . (2.16) 

0 


It  follows  that  with 

u(t)  = -Y(x(t),  b)  (2.17) 

the  energy  £(t)  is  non- increasing  with  increasing  t . So  far  this  is 
basically  a Liapounov  approach  employing  what  is  known  in  the  engineering 
literature  as  an  IIAF  (Identical  Location  of  Accelerometer  and  Forces) 
approach.  The  resulting  closed  loop  system  is,  still  in  second  order  form, 

X + Bx  + Ax  = 0 (2. 18) 


with  B defined  by 


With 


Bx  = Y (x,  b)b  . (2.19) 

y = X , one  may  consider  the  equivalent  first  order  system  in  XX  X, 


C = 


(2.  20) 


and  ask:  what  are  the  eigenvalues  and  eigenvectors  of  C?  It  is  here  that 

one  leaves  the  second  method  of  Liapounov  and  returns  to  his  first.  In  [32] 
a perturbation  analysis  is  carried  out,  valid  for  small  values  of  y in  (2. 17), 
(2.19).  It  is  shown  that,  under  the  separation  assumption 


^k+1  - ^ d > 0,  ~ , (2.  21) 

the  eigenvalues  of  C,  which  for  y = 0 nre  k = 1, 2, 3,***,  all 

have  negative  real  parts  for  y ^ 0 and,  moreover,  designating  the  perturbed 
eigenvalues  by  k = ± 1,  ± 2,  ±3,*--,  ^^^(0)  = iw^,  "^^k' 

we  have  (cf.  (2. 21)) 

^j^(Y)  = iwj^  - Ibj^  1^  + $(Y^~^  ),  k » (2.22) 

k 
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? v(v)  = -iw,  - |lb  1^+  ),  k-«.  (2.22) 

-k  k 2 k j^^jZ 

It  is  also  possible  to  show  that  the  perturbed  eigenvectors  continue  to  form  a 
Riesz  basis  for  the  space  XXX.  From  this  it  follows  that  all  solutions  of 
(2. 18)  tend  strongly  to  zero  in  the  energy  norm,  though  not  at  a uniform  expon- 
ential rate. 

Following  Wonham*  s initial  results  [48]  on  the  finite  dimensional  case, 
there  has  been  considerable  interest  displayed  in  the  question  of  spectral 
determination  via  linear  feedback  for  distributed  parameter  systems.  In  terms 
of  the  system  (2. 14),  equivalently. 


X 

y 


u. 


(2.23) 


with  initial  (u  = 0)  eigenvalues  ± iw^^,  k = 1,  2,  3,  • • • , the  question  may 
be  phrased  as  follows:  we  suppose  use  of  a linear  feedback  functional 


1 

u = (A^x,  kj^)  + (y, k2) , e X,  (2.24) 

i i 

bounded  relative  to  the  energy  norm  (x,Ax)  + (y,  y)  = (A^x,  A^x)  4 (y,  y)  in 
XXX.  With 

Kj^x  = (A^x,  kpb,  K2y  = (y,k2)b  (2.25) 

the  closed  loop  system  is 


One  can  now  ask:  What  eigenvalues  can  be  achieved  for  the  closed  loop 

system  (2.24)  by  appropriate  selection  of  k2  in  (2.25)?  For  some 
time  the  author  was  under  the  Impression  that  his  approach  via  canonical  forms 
[35]  (more  on  this  below)  was  the  first  treatment  of  this  question  but,  in 
fact,  it  appears  that  this  credit  must  go  to  Prof.  Sun  S.  -H.  of  Szechuan 
University  who  treated  this  problem  by  a more  sophisticated  application  of  the 
perturbation  technique  used  by  the  author  in  [32]  to  obtain  the  result . 

Sun  v/as  able  to  show,  with  an  assumption  similar  to  (2.  21)  and  the  Riesz 
basis  assumption  on  the  open  loop  eigenvectors,  that  the  totality  of  spectra, 
achievable  by  use  of  (2.25)  coincides  with  sequences  ^ k = 

1,  2,  3,  • « • for  which,  assuming  the  hy.  0 as  before. 
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Z (1 


k=l 


2 

1 + 


-k  ■'i^k 


) < 


His  very  Important  paper  has  been  translated  by  Ho  L.  -F.  in  [44]  . Some 
comparable,  but  necessarily  weaker,  results  have  been  obtained  by  Reid  in  his 
thesis  [29]  for  the  equation  of  linear  surface  waves  where  (2.  21)  is  not 
satisfied  and,  in  fact,  lim  = 0 • Other  results  in  this  | 

k -*•  09  I 

direction,  for  hyperbolic  systems  of  various  types,  have  been  obtained  by  Clark j 
[5],  [6]  and  by  Ho  in  his  thesis  [16],  j 

Much  of  the  initial  impetus  for  the  study  of  control  canonical  forms,  I 
both  for  finite  and  infinite  dimensional  systems,  came  from  the  spectral 
determination  question  discussed  above,  but  the  subject  is  interesting  in  its 
own  right  and  shows  some  promise  of  being  adaptable  for  "real  world"  control 
implementation.  The  reader  will  recall  that  a finite  dimensional  controllable 
system 

X = Ax  + bu,  X e , 

with  scalar  control  u is  equivalent,  via  a state  space  similarity  transform-  j 
ation  (see  [20],  [35])  to  a system  in  rational  canonical  form  corresponding 
to  the  n-th  order  scalar  equation 

y^^^  + + • • • + a^_jy'  + a^y  = u,  (2.  2?) 

where 

p(\)  = det(M-A)  = a^X^"^  + •••  a^_^X  + a^ 

is  the  characteristic  polynomial  of  the  matrix  A . Comparable,  but  somewhat 
more  intricate,  results  are  available  for  systems  with  higher  control  dimension 
[20]  , [2]  . In  [38]  we  note  that  if  one  employs  a scalar  linear  observation 

y = h""x  = ( x,  h ),  (2.  28) 

there  is  exactly  one  observation  vector  h e R'^  for  which  (2.  28)  satisfies 
(2.  27);  for  general  h the  right  hand  side  will  involve  the  derivatives  of  u 
of  order  u - 1 . Systems  (2.  27)  are  particularly  easy  to  deal  with. 

Closed  loop  eigenvalues  ' ’ * » Cn  realized  simply  by  forming 

the  polynomial 

<3  ( ^)  = 1 t ( ^ “ Ck^  “ + Cj^  ^ + . . . + + c^ 

k — 1 


and  determining  u by  linear  feedback  on  the  observation  y and  its 
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derivative  s , 


u = i • 

k = l 

Apparently  less  well  known,  but  quite  obvious,  is  that  the  control  problem  for 
(2.  27)  is,  in  a sense,  trivial.  Let  us  suppose  the  initial  instant  is  taken  to 
be  t = 0 and  control  is  to  be  effected  during  0 ^ t T . Let  the  initial 

state  be  specified  by 

k=l,  2,  ...,n  (2.29) 

and  the  terminal  state  by 

y(n-k)  ^ , k = 1,  2,  • . • , n . (2.  30) 

If  y(t)  satisfies  (2.29)  and 


y'  ^(t)  = v(t),  0 < t s T 

then  we  see  readily  that  for  k = 1,  2,  • • • , n 

k V — ^ t* 

y («  = A y„-t+i  Tkrr)T  + J (t-s)  v(s>ds 

£ = 1 ^ 


and  (2.  30)  is  achieved  just  in  case 

T , „ k mk-£ 

/ (t-s)  ■ v(s)ds  = 

^ £ = 1 
k = 1,  2,  • • • , n . 


(k-£)l  ' 


This  is  easily  solved  for  v in  various  function  classes,  e.g.  polynomials  of 
degree  ^ n - 1,  etc.  and,  it  should  be  noted,  the  solution  has  nothing  to  do 
with  the  coefficients  in  (2.  27)  so  the  calculation  can  be  carried  out  once  for 
any  given  T and  recorded  for  use  ever  after.  Then  in  a given  canonical 
system  (2.27)  we  need  only  set 

n 

u(t)  = v(t)-  Yj  (t)  (2.32) 

k=l 

to  realize  the  desired  control  objective. 

Since,  in  a given  control  context,  it  is  not  likely  that  the  available 
obse:rvation  (2.  28)  will  be  the  particular  one  for  which  (2.  27)  obtains,  the 
above  result  might  seems  to  be  a generally  useless  curiosity.  It  turns  out, 
however,  that  in  canonical  form  theory  there  is  a counterpart  to  the  more  widely 
known  observer  theory.  If  C is  any  nX  n matrix  whose  minimal  and 
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characteristic  polynomials  coincide,  it  is  possible  to  select  (non -uniquely)  r, 
d and  j such  that  the  augmented  system 
X = Ax  + bu 


z = ry  + Cz  + du  ( = rh^'x  + Cz  + du  since  y = h"‘x ) 
with  augmented  observation 


w=y  + j z = hx+j  z 

is  in  canonical  form,  so  that  for  some  coefficients  oiy  ci^, 


(2.  33) 
(2.  34) 

(2.35) 


O' 


2m 


(2n)  , (2n-l)  , 

W + + 


The  adjoined  system  (2.  34)  can  be  realized  electronically,  just  as  an  observer 
system  is,  and  the  considerable  freedom  in  choice  of  C , r,  d and  j 
provides  much  design  flexibility.  In  some  cases  the  dimension  of  (2.  34)  can 
be  reduced.  The  proof  that  (2.33),  (2.  34),  (2.  35)  can  be  made  a canonical 

system  appears  in  [38]  . 

A parallel  control  canonical  form  theory  has  been  developed  for  certain 
hyperbolic  distributed  parameter  systems,  involving  neutral  functional  equations 
in  place  of  the  n-th  order  scalar  equation  (2.  27).  The  theory  is  quite  comple?^, 
especially  as  it  applies  to  partial  differential  equations  with  variable  ! 

coefficients  (see  [ 35  ] , [16]  , [38]  , [ 39  ] e.g.).  To  give  an  idea  how  the  i 

theory  is  developed  we  will  consider  the  constant  coefficient  case  of  (1.  4)  | 

which,  without  loss  of  generality,  we  can  take  to  be 


a^w 


a^w 


at  ax 

w(0,t)  = 0 , 


= 0,  t>0,  0:^x^l, 


(l,t)  = u(t)  . 


(2.  36) 

(2.37)  i 


The  normalized  eigenfunctions  of  the  corresponding  homogeneous  system  are 


(x)  = sin 


2k- 1 


k 
2k -1 


TTX  , 


k = 1,  2,  3, 


Setting  w^^  = — ^ 


IT 


and  forming  the  expansions 


w 


(x,t)  = 2 Wj^(t)  iZ^]^(x)  , 
k=l 


k=l 

followed  by  the  transformation 


(2.38)  I 

(2.  39) 
(2.  40) 
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(2.  41) 


(2.  43) 


we  have,  for  k = 1,  2,  3,  • • • 

\ ?k  = -‘*'kCk  + 

Consider  now  the  neutral  delay  equation 

y(t  + 2)  + y(t)  = u(t+2)  . {2.41 

The  characteristic  function  of  the  homogeneous  equation  is 

p (X.)  = +1  = 2e  ^ cosh  k 

and  the  zeros  of  p(k)  are  precisely  the  eigenvalues  ± appearing  in 

(2.  42)  . The  transfer  function  for  (2.  43)  is 

= I -flT  + I (2-4^ 

e +1 


which  can  be  rewritten  as 


V’')=  Z -2^ 


. 1 „ 2k-l 

2 ’ “k  “ 2 


If  we  define  an  observation  y (t)  on  (2.  42)  by 


(2.  44) 


(2.  44) 


y(t)=  Yj  \ +2u(t) 

k=  1 

the  transfer  function  for  y is,  formally, 

05  , , i»k-l  _i_  I ^ «k-l  _1_ 

y nIZ  ^ gk<-*>  nTT 

^ \ • \ t • 


k = 1 


X - it 


X + it 


which  may  be  seen  to  agree  with  (2.  44)  just  in  case 

Using  (2.  38),  (2.  39),  (2.  41),  (2.  42)  it  may  be  seen  that  this  choice  of  h^, 

corresponds  to  ! 


(2.45)  I 


This  observation  on  (2,  42),  and  no  other,  satisfies  the  scalar  equation  (2.  43) 
which  serves  as  the  control  canonical  form  for  (2.42).  The  details  of  the  above 
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calculations  and  some  idea  of  the  form  of  a general  theory  appear  in  [38]  and  | 
[39].  i 

I 

If  the  canonical  observation  (2.  45)  were  actually  available,  so  that 

we  have  (2.  43),  its  usefulness  is  quite  clear.  For,  with  the  causal  feedback 

law  2 

u(t  + 2)  = (1-  -Y)y(t)  - f c(s)y(t+s)ds  (2.46)  i 

0 

(2.  43)  transforms  to 

2 

y (t +2)  + Yy(t)  + / c(s)y(t  + s)ds  = 0 (2.47) 

0 t ^ , t 

and  it  is  known  from  [35],  [44]  that  the  exponential  solution  e , e “ i 

of  (2.  47)  can  be  made  such  that 

?k  = ‘“k  + “ + =k>  ?-k  = ‘“k + “ + ®-k- 

where  a is  a complex  number  (ordinarily  negative)  determined  by  y 
^k’  ^ k arbitrary  complex  numbers,  determined  by  c e L [O,  2]  , such 

that  00 

E (1  ^ + 1 ^ * ' 

k=  1 


It  may  be  shown  that  these  are  the  eigenvalues  of  the  closed  loop  system  (2.36), 
(2.37),  (2.45),  (2.46). 

In  a given  application,  however,  it  is  entirely  likely  that  the  particular 
"canonical"  observation  (2.46)  will  not  be  available.  Indeed,  in  the  example 
indicated,  since  this  observation  is  taken  at  the  same  point  where  control  is 
applied  and  might,  therefore,  be  subject  to  a certain  amount  of  noise 
disturbance,  it  might  not  be  desirable  to  use  this  observation  in  practice.  To 
illustrate  the  use  of  the  technique  of  canonical  augmentation  (or  "canonical 
compensation",  perhaps)  let  us  consider  the  same  system  (2.36),  (2.37), 
but  suppose  the  available  observation  is 

y(t)  = -—  (0,t)  . (2.48) 

It  is  not  hard  to  show  in  this  case  that  y (t)  satisfies  i 


y(t  + 2)  + y(t)  = u(t  + l) 


(2.  49) 


rather  than  (2.43).  This  "central"  control  canonical  form  is  not  as  usable  as 

the  "backward"  form  (2.43)  because,  unlike  (2.46), 

.2 

u (t  + 1)  = (l-y)y  (t)  - / c(s)y(t  + s)ds 

0 
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is  not  a causal  feedback  law  and  cannot  be  implemented.  But  now  couple  (2.49) 
with 

z(t+2)+  pz(t)  = au(t+2)+bu(t+l) 

+ cy  (t + 1 ) + dy  (t ) (2.50) 

and  let 


w(t)  = y (t)  + z (t)  . 


One  ordinarily  will  take  1 p 1 < 1 so  that  the  homogeneous  part  of  (2.  50)  is 
asymptotically  stable,  thus  avoiding  the  growth  of  parasitic  solutions  in  the 
compensator.  Since 

[y(t+4)  + y(t+2)  - u(t+3)]  + p[y(t+2)  + y(t)  - u(t+l)]  = 0 

while 


[ z (t  +4)  + pz  (t  + 2)  - au  (t  +4)  - bu  (t + 3)  - cu(t  +1)  - cy(t  +3)  - dy  (t  +2)  ] 

+ [ z(t  +2)  + pz  (t)  - au  (t  + 2)  - bu  (t  +1)  - cy  (t  +1)  - dy  (t)  ] =0 
we  find  thal: 

w (t  + 4)  + (1+  p)  w (t  +2)  + pw  (t)  = au  (t  +4)  + [ 1 +b]  u (t  + 3) 

+ au(t+2)  + [p  + b]  u(t+l)  + c[y(t+3)  + y(t+l)] 

+ d[y  (t  + 2)  +y  (t)]  = (using  (2.49)) 
au(t  + 4)+[l+b]  u(t+3)+[a+c]  u(t  + 2)+[p+b+d]u(t+l). 
Then  it  is  easy  to  see  that  with 

a = l,  l + b=  a+  c=  p+  b+  d=  0, 

i.e.  with 

a = 1,  b = -1,  c = -1,  d = 1 - p , 
we  arrive  at  the  "backward"  canonical  form  satisfied  by  w(t)  : 

w(t+4)  +(l  + p)w(t+2)  + pw(t)  = u(t  + 4) 

for  which  causal  feedback  laws 


u (t  + 4)  = -y^w  (t  +3)  + [1  + p -Yo]  V/  (t+2)  - Yo  w (t + 1) 

U (2. 51) 

+ [P“Y4l  w(t)  - j c(s)w(t  + s)ds 

0 

may  be  implemented,  yielding  overall  closed  loop  systems 
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w(t  + 4)  +Y,w(t+3)  +Y„w(t  + 2)  +Y'aW(t+l)  +YA^(t)  i 

4 ^ ^ (2.52)  ; 

+ 1 c (s)  w(t  +s)  ds  = 0 . 

0 

It  is  necessaiy  to  check  separately  that  the  system  (2.49),  (2.50),  (2.51)  is  i 
observable  in  any  given  case. 

The  exponential  solutions  of  (2.  52),  and  hence  the  eigenvalues  of  ! 
(2.36),  (2.37),  (2.48),  (2.50),  (2.51)  may  be  determined  with  the  same 
flexibility  as  already  noted  for  (2.47).  This  is  discussed  in  some  detail  in  the! 
thesis  of  R.  G.  Teglas  [45]  . A complete  theory  of  canonical  compensation  | 
for  hyperbolic  systems  remains  to  be  developed  but,  we  hope,  the  example 

i 

given  here  gives  reason  to  believe  that  the  method  is  a promising  one.  It  is  : 
clear  that  there  are  some  connections  with  observer  theory  as  developed  in  ' 
[22]  and  elsewhere;  these  connections  remain  to  be  worked  out. 
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1.  Introduction 


Several  recent  papers  by  the  present 
author  have  applied  the  modern  theory  o£ 
the  infinite  dimensional  regulator 
problem  to  problems  in  active  control  of 
flexible  structures  ([2],  [3],  [4],  [5], 
[6]). 


IVe  study  the  sense  in  which  the 
feedback  control  law  based  on  an  approxi- 
mate finite  dimensional  model  of  a con- 
tinuous structure  approximates  a control 
law  which  is  optimal  for  the  distributed, 
or  infinite  dimensional,  model  of  the 
structure.  From  the  analysis  of  the 
various  control  and  stability  issues 
associated  with  this  basis  question,  we 
can  gain  useful  information  for  design- 
ing finite  dimensional  compensators 
which  produce  near-optimal  performance 
in  infinite  dimensional  systems. 

Our  most  important  analytical  tools 
are  the  properties  of  the  infinite 
dimensional  Riccati  equation  and  ap- 
proximation theory  for  its  approximate 
solution  by  means  of  finite  dimensional 
Riccati  matrix  equations.  In  this  paper, 
we  hope  to  indicate  some  of  the  impor- 
tant predictions  that  can  be  made  about 
large-order  finite  dimensional  control 
laws,  using  the  theory  of  infinite 
dimensional  Riccati  equations. 

2 . The  Infinite  Dimensional  Problem 

We  consider  an  abstract  second  order 
(in  time)  linear  system 

(1)  x+Cx  + A x = 8u, 
o o 0 

where  the  state  vector  x(t)  is  in  a real 


Hilbert  space  H and  the  control  vector 
u(t)  is  a real  m-vector  for  some  finite 
m.  The  linear  operator  is  a self- 
adjoint  operator  in  H with  compact  re- 
solvent, C = C*  >0  and  C is  A - 
o o o o 

bounded,  and  8 is  bounded.  With 
y = (x,x),  (l)^takes  the  first  order 
form 


(2)  y = Ay  + 8u  . 

See  [2]  for  details.  The  state  vector 
y(t)  is  cast  in  a Hilbert  space  E such 
1 2 

that  -jll  y(t)||g  is  essentially  the 
total  energy  in  the  system. 


In  the  infinite  dimensional  opti- 
mal control  problem,  we  choose  u to 
minimize 


(3) 


u 


2 ^ , 
)dt. 


where  V - V*  > 0 and  V is  bounded. 


If  J can  be  made  finite  for  each 
y(o),  then  an  optimal  control  exists 
for  each  y(o)  and  has  the  feedback  form 

(4)  u(t)  = -8*Py(t), 

where  P = P’*  is  nonnegative  and  bounded 
and  satisfies  the  infinite  dimensional 
Riccati  equation 

(5)  A*?  + PA-P88*P  + P = 0. 

See  [1],  [2]. 

When  P exists  for  V positive  definite, 
then  the  closed- loop  semigroup  generated 
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by  A-  BB*?  is  uniformly  exponentially 
stable. 

3.  Approximation 


approximates  the  semigroup 

generated  by  A-B8*P  in  norm,  and  hence 
is  uniformly  exponentially  stable  for  n 
sufficiently  large. 


To  approximate  (4)  and  (5),  we 
define  Ej^  to  be  the  subspace  of  E 
spanned  by  the  first  n eigenvectors  of 
A , and  project  the  infinite  dimensional 
problem  onto  to  obtain  the  nth  finite 
dimensional  problem.  Again,  see  [2] 
for  details. 

The  eigenvectors  of  A^  are  the 

natural  mode  shapes  of  free  vibration, 
so  that  we  are  using  modal  approximation 


Projecting  the  operators  A,  B,  and 

V onto  E yield  the  corresponding 

operators  A , B , and  D . For  the  nth 
^ n n n 

problem,  we  then  obtain  the  Riccati 
equation 


(6) 


A*P 


n n 


+ P A 
n n 


P B B*P  + D = 0. 
n n n n n 


4 . Exampl e 

We  take  a simply- supported  beam 
whose  first  natural  frequency  is  10  hz.; 
i.e.,  the  lowest  eigenvalue  of  the 
operator  A^  in  (1)  is  100.  The  control 

force  is  distributed  over  a pad  whose 
length  is  five  per  cent  of  the  beam's 
length.  For  damping,  we  assume 

C = c A^.  which  yield  the  same  damp- 

ing  ratio  for  each  mode.  V/e  denote  by 
x(t,Ti)  the  displacement  of  a point  a 
distance  q from  the  left  end  at  time  t. 


In  the  optimal  control  problem,  we 

take 


(7)  J 


(2E(t)  + u^(t))dt, 

•'0 


This  equation  amounts  to  an  2nx2n 
Riccati  matrix  equation  because  the 
operators  involved  are  all  finite  di- 
mensional. Our  analysis  hinges  on  when 
and  in  what  sense  P approximate  P if  P 
exists. 

From  here  on,  we  will  assume  that 
A is  positive  definite  and  that 
1 2 

■^11  llg  equal  to  the  total  energy 

in  the  structure.  According  to  [2],  if 
V is  positive  definite  (coercive)  and 


where  E(t)  is  the  total  energy  in  the 
beam,  l^en  a solution  to  (3)  exists, 
(4)  becomes 


(8)  u(t)  = 


f" (q) x" (t ,q)dn 


0 


0 (no  damping  in  the  free  structure) 


then  P does  not  exist  and 


n ' 


increases  without  bound  as  n increases. 
On  the  other  hand,  if  is  positive 

definite,  then  P exists  and  P A 

n n 

converges  strongly  to  P,  where  A is  the 
projection  operator  mapping  E onto  E^. 

Also  when  is  positive  definite,  the 
closed-loop  semigroup  generated  by 


g(n)x(t,n)dn, 

0 

.2 

where  (•)''  = — o • (Details  are  in  [7]..) 
9n 

From  the  nth  approximate  problem 
we  obtain  approximations  f^  and  g^  to 

f and  g by  using  the  solution  to  the 
Riccati  equation  (6) . We  should  note 
that  f^  and  g^  give  graphical  repre- 
sentations of  the  modal  control  law 

(9)  u = -B*  P y (t) 
n n n n 

that  would  be  used  in  a finite  dimen- 
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sional  compensator  based  on  an  n-mode  7.  J.  S.  Gibson,  "Approximation  Theory 
model  of  the  beam  (see  [2],  [7]).  for  Linear- quadratic  Optimal 

Control  and  Estimation  of  Hyper- 

Figures  1,  2,  and  3 show  our  numer-  bolic  Systems,"  to  appear. 

ical  results  for  f and  g with  increas- 
n n 

ing  n.  Note  the  important  role  of 
inherent  system  damping  when  large 
numbers  of  modes  are  modeled.  Perhaps 
our  most  significant  results  pertain  to 
the  importance  of  modeling  damping  in 
large  numbers  of  modes. 
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ABSTRACT 


Vie  consider  the  dynamic  response  of  beams  and  plates  to  loads  which  are 
extreme  within  a certain  class  of  admissible  loads.  Two  approaches  to  this 
problem  are  suggested.  In  approach  one,  Pontrjagin's  maximal ity  principle  is 
regarded  as  an  additional  constraint.  The  optimality  of  design  is  then 
determined  by  standard  numerical  techniques. 

Secondly,  we  proceed  with  the  "adjoint  variable  approach"  to  sensitivity 
of  structural  design  for  a given  inhomogeneous  term.  The  inhomogeneous  term 
is  an  extremal  element  of  admissible  load  vectors,  which  constitute  a closed 
subspace  of  a Sobolev  space.  Again,  we  invoke  the  maximal ity  principle.  While 
only  beam  and  plate  theory  problems  are  used  as  examples,  generalizations  are 
easy  to  perceive. 


0.  NOTATION  AND  SOME  BASIC  ASSUMPTIONS 


Symbols  used  in  the  discussion  of  beam  theory  have  the  following  meaning; 

p denotes  the  material  density  per  unit  length. 

I:  denotes  Young's  modulus,  which  is  assumes  to  be  constant  (E  > 0)  in  our 
d i scuss ion . 

A is  the  cross-sectional  area  (A(x)  > 0, 

I is  the  moment  of  inertia  of  the  cross-sectional  area  about  the  neutral 
axis  of  bending,  i.e.,  a line  whose  length  remains  unchanged  locally  as  the 
beam  undergoes  deformation.  I is  not  assumed  to  be  constant,  but  is  a positive 
function. 

t denotes  time. 

The  Cartesian  coordinate  system  is  used.  The  x-axis  coincides  with  the 
underformed  axis  of  the  beam,  which  occupies  the  interval  Q = [-J!-,  +&]  . W(x,t) 
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denotes  the  displacement  function  which  is  an  element  of  Sobolev  space  H 
determined  by  the  existence  of  the  following  integrals: 


>dx 


P A(x) 


dx, 


for  all  values  of  t in  a given  time  interval  [0,T].  The  inner  product  in  H is 
given  by 


(0.1) 


'•+£ 


L 


E l(x) 


^9^W^(x,t) 


9x 


9x 


dx 


P A(x)| 


^9Wj(x,t)  9W2(x,t) 


9t 


9t 


dx. 


This  is  distinguished  from  the  usual  L2  (inner)  product 


(f ,g)  = / [f (x,t)  g(x,t) ]dx. 


The  total  energy  of  the  beam  is  identified  with  the  energy  norm 


(0.2) 


W(x,t) 


e(t)  , 
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which  differs  from  l2(^^)  norm 


= (W,W) 


1/2 


We  assume  that  A(x)  and  I (x) 


depend  uniquely  on  the  choice  of  m-tuple  of  design  parameters 


(x)  , S^(x)  , S^(x)|-  = S; 


that  is, 


and 


A = A^S(x)  ;xj 


I = 1 |s(x) ;xj. 


The  vector  S(x)  consists  of  m-design  functions,  such  as  thickness  of  the  web, 
width  of  the  flanges,  etc.,  which  are  to  be  determined  an  optimal,  or  near- 
optimal  design.  The  functions  Sj(x),  i = 1,  2,  ...,  m,  -5.  ^ x ^ +Z,  obey 
certain  constraints,  such  that  0 < Gj  ^ Sj(x)  ^ Ej,  where  G;,  E|  are  some 
constants  determined  by  manufacturing  limitations.  The  set  ,9^  of  designs 
obeying  these  constraints  is  called  admissible.  The  set  of  admissible  loads 
Includes  all  bounded  (Lebesgue)  measurable  functions  in  fi,  and  linear 
combinations  of  the  spatial  Dirac  delta  function  and  its  first  derivative 
concentrated  at  finite  number  of  points  in  Q,  multiplied  by  an  L2  (0,T) 
function  of  time;  that  is,  an  admissible  load  q(x,t)  is  of  the  form: 


(0.3)  q(x,t)  = g(x,t)  +^^a.6(x  - 5j)  <t>|(t)  bj6"(x  - li^j(t), 

j=l 

where  g(x,t)  is  a bounded  measurable  function  on  x [0,T];  a],  i = 1,  2,  ..., 
n],  and  bj,  i = 1,  2,  ...,  n2,  are  constants  while  the  <t)|(t),  <t)j  (t)  are  bounded 
L2  [0,T]  functions. 
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|q(x,t)||Q  is  the  x L2[0,T]  norm  given  by 


llq(x,t)  IIq  = sup  I (q,W)  | 

IIwIIl 

2 


The  admissible  loads  q(x,t)  e Q obey  the  constraint  ||q(x,t)|l  q - M 
fixed  constant  M > 0. 

The  deflection  function  W(x,t)  obeys  the  state  equation 


(0.4) 


9^W  (s(x);  x,t) 


9t 


= q(x,t) , 

where  q(x,t)  is  an  admissible  load,  and  S(x)  an  admissible  design, 
can  be  rewritten  as  a system 


M(x,t)  = - E I 


(s(x)  ,xj 


2 

9 W 


^S(x) ; x,tj 


9x 


(0.5) 


a(s(x),x) 

3x 


'W^S(x)  ;xtj 


= q(x,t) 


9t 


The  boundary  conditions  are 


(0.6) 


W ^S(x) ; X, = 0 j for  al 1 t € [0 .T]  , 


M (x,t)  = 0 


X = -I,  or  X = +£. 


for  some 


Equation  0.4 
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The  initial  conditions  in  most  realistic  problems  are  not  known.  Here,  we 
assume  for  the  sake  of  simplicity  that 


(0.7) 


W ^S(x)  ,x,ojs  0. 
M (x,0)  = 0. 


1 . STATEMENT  OF  THE  PROBLEM 


We  consider  a cost  functional  J (s{x)  ; W^S(x)j;  q(x,t)  ; tj  and  we  wish  to 
find  a design  S(x)  e SP,  such  that 


S(x) 


min  max  <Sup  (j|S(x);  W(s (x)\  ; q (x , t)  ; t| 


Ss^‘-te[0,T]  'qcQ 


For  example,  we  may  wish  to  minimize  the  total  energy 


J 


^S(x) ; W^S(x) ; q(> 


e (S,q,t) 


Our  problem  consists  in  finding  S(x)  e ,9^  such  that 


(1.1) 


e (S,q,t)  = 


m I n 


max 
lt€[0,t] 


/sup 

\q€Q 


(S;q,t) 


In  what  follows  we  shall  always  assume  that  the  set  Q of  admissible  loads  Is 
bounded  in  the  Q-norm  and  convex.  For  the  sake  of  convenience  let  subscripts, 
from  now  on,  denote  partial  differentials,  i.e.. 


W 


X 


9W 
9x  ’ 


9t  ’ 


etc. 


To  prove  some  preliminary  lemmas  we  need  the  following  known  identity 
(for  stated  boundary  conditions) 


Hence, 


(1.3) 


_d_ 

dt 


W (q(x,t) 


Clearly,  if  sign  q(x,t)  = sign  W^.  (x,t)  for  some  x e on  a set  of  positive 
measure  in  fJ,  and  q (x,t)  = 0 elsewhere  in  fi,  we  must  have 


(1.^) 


d 

dt 


ll|w(q(x.t))|!|  ^ 


> 0. 


Of  course,  we  do  not  know  whether  such  design  exists.  However,  the  following 
lemma  is  easy  to  prove. 

Lemma  1 

For  any  fixed  t € [0,T]  and  fixed  S sup  £(S;q,t)  exists  and  for  a 

fixed  S qeQ 

max  /sup  e(S,q,t)\  = sup  e(S,q,T). 

te[0,T]  \^q€(J  ~ J qeO.  ~ 

Proof 

The  first  statement  follows  trivially  from  boundedness  of  the  set  of  all 
numbers  e(S,q,t),  S c q e 0,  t € [0,T]. 

The  second  statement  can  be  proven  as  follows.  Suppose  that  for  some 
choice  ofqcQ,  = max  E(S,q,t)  > 0 occurs  at  some  time  t = t<T. 

° t€[0,t] 

Let  us  suppose  that  force  q = 0 is  applied  to  the  beam  on  a time  interval 
(t,t]),  t < t]  < T.  Clearly,  t]  can  be  chosen  so  that  W^(x,tj)  / 0 on  some 
subset  of  containing  an  interval  [x] , X2]  € fi.  Applying  an  admissible  load 
q(x,t)  such  that  either  sign  q(x,t)  = sign  Wt(x,t)  on  [x| ,X2l  and 
q(x,t)  = 0 otherwise,  or  q(x,t)  = 0 on  some  subset  of  [x] ,X2]  for 
t e [t|,t2]  C [tj,T]  with  [t],t2]  chosen  sufficiently  small,  we  obtain 
e(S,q,t2)  > Eq  which  is  a contradiction,  proving  our  contention  that  t = T. 
Hence,  the  problem  is  reduced  to  finding  a design  S e ^ such  that 

e (S,q,T)  = min  /sup  e (S,q,T) 
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2.  PONTRJAGIN'S  PRINCIPLE 


For  a fixed  design  S the  extremal  control  q € Q satisfies  Pontrjagin's 

maximaHty  principle  in  the  form  given  in  References  1 and  2.  Specifically, 
let  W|^(S,x,t)  be  a solution  of  the  homogeneous  equation 


where  W|^  obeys  the  boundary  conditions  of  Equation  0.6  at  x = ±%,  and  satisfies 
an  energy  condition 


(2.2)  e ^ 

for  any  other  solution,^W^  ^S(x);x,tj  satisfying  the  same  boundary  conditions. 
(Note  that  the  design  S(x)  is  fixed!) 

Then  the  following  inequality  must  be  true  for  all  values  of  time  on  the 
interval  [0,T] 


'+5, 


(2.3) 


■q(x,t)W^  (S(x);x,t)dx  = max 

*^t^~  > qeQ 


-q(x,t)W^  ^S(x)  ;x,tyj 


dx 


for  an  extremal  control  q(x,t).  Let  us  make  the  following  comments.  For  a 
fixed  S € gp  the  final  state  WjS(x),x,Tj,  which  maximizes  the  total  energy  at 
t = T , is  not  un i que . ' ' 


3.  DUHAMEL's  PRINCIPLE 

This  differs  sharply  from  the  theorem  established  in  Reference  2 for 
optimal  controls  minimizing  the  final  energy.  The  reason  for  this  difficulty  is 
the  nonconvexity  of  final  states.  To  prove  some  properties  of  extremal  energy 
states  we  proceed  to  postulate  a strong  form  of  Duhamel's  principle.  There 
exists  a kernel  function  G(x,E,t,T),  which  depends  only  on  the  coefficients  p, 

E,  1,  and  A of  the  Equation  0.4  and  on  the  assigned  boundary  conditions,  but 
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is  independent  of  the  initial  data  and  of  the  inhomogeneous  term  q(d,t),  and 
such  that  the  solution  of  Equation  O.A  can  be  written  as 


(3.1) 


W(x,t) 


j^G(x,E,t,T)  q(5,T)JdCdx 


W|^(x,t) 


For  example,  in  the  static  deflection  case  for  a freely  supported  beam, 
this  kernel  is  of  the  form 


(3.2) 


+ 5x 


(1  - r\/l)' 
El  (n) 


dn 


with  the  kernel  G(5,x)  defined  on  [0,£]  x [0,i]  by  the  relation; 


(3.3) 


G(5,x)  = c(5,x)  if0^5^x^£ 


and 


G(?,x)  = c(x,E)  if  0 ^ X ^ 5 ^ £ 


The  solution  of  the  homogeneous  equation  Wj^(x,t)  satisfies  the  stated  boundary 
cond i t i ons . 
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Lemma  3-1 


We  assume  a fixed  design  S € . Suppose  that 
extremal  admissible  loads  maximizing  the  energy  e at 
initial  conditions.  Then  the  corresponding  extremal 
W2(x,t)  obey  the  condition  <W|  , W2>  ^ | t;=T  ^ 0. 


q] (x,t) , q2(x,t)  are 
the  time  t = T,  with  zero 
displacements  W](x,t), 


Proof 


We  use  the  convexity  of  the  set  of  admissible  loads.  Hence,  if  q]  , q2 
are  admissible  l/2(q]  + q2)  is  admissible.  ^Using  Formula  3.1,  we  conclude 
that  the  corresponding  displacement  is  1/2(W]  + W2) , where  W] , W2  correspond 
to  the  loads  q]  , q2 , respectively.  (We  recall  that  W|^(x,t)  e 0 in  the 
Formula  3.1  because  of  our  assumption  concerning  initial  condit[ons.)  Suppose 
that  qWS)  , q2(S)  are  two  external  loads  such  that  e(q][S)  ,T)  = e(q2(S),T) 

- e (q(sy,T)  for  any  admissible  load  q(x,t)  € Q.  Let  W](S(x),x,t)  be  the  dis- 
placement corresponding  to  q^  and  W2(S(x) ,x,t)  displacement  corresponding  to 
q2.  We  compute 


-(W 

2^1 


«2>)  t. 


1_ 
T 2 


(<W, 


2<W^ ,W^>  + <W 


2’ 


/S  V 

^2^)t= 


(e  + 2 <W, 


,W^> 


A 


= e(T)  + 2 <W, 


Since  e(T)  is  the  maximal  attainable  energy  level  at  t = T,  we  have 


<W, 


W > 

2 filt=T 


^ 0. 


Lemma  3.2 

The  final  state  at  t = T corresponding  to  a maximal  optimal  control  with 
zero  initial  condition  results  in  a first  eigenmode  vibration  if  the  beam 
vibrates  freely  on  any  interval  [T,  T^]  (with  q = O) . 

Out  line  of  Proof 

We  shall  assume  that  q e Q implies  HI  q 111  q - 1.  Let  us  suppose  that  the 
final  state  {W(x,T) , cannot  be  attained  as  a state  corresponding  to 

the  first  eigenfunction  of  the  system  (j)i(x,t). 
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The  homogeneous  solution  W^Cx.t)  such  that  W|^(x,T)  = W(x,T)  and 
Wh^(x,T)  = Wt(x,T)  can  be  expanded  in  terms  of  eigenfunctions  of  the  operator 

given  by  Equation  0.4  on  the  ray 

00 

(T,“>)  • W^(x,t)  = C.cj).  (d,t)  , 
i = l 


where  {(l),(x,t)}  form  an  orthonormal  system. 

We  only  need  to  show  that  some  extremal  admissible  load  (i.e., 

|lq|l=  1)  brings  the  system  to  rest  at  t = 2T  starting  at  energy  level  e = e at 
t = T,  but  that  system  with  identical  initial  energy  is  controllable  on  [T,2T] 
if  the  initial  state  corresponded  to  the  second  (or  higher)  eigenfunction. 

This  is  a technical  computation  comparing  the  work  done  by  opposing 
maximal  velocity  point  with  a Dirac  delta  force  over  the  interval  [T,  2T]  as 
the  beam  vibrates  in  a second  fundamental  mode.  The  key  to  this  argument  is 
the  fact  that  the  mean  velocity  over  one  cycle  of  vibration  is  lower  at  the 
point  of  maximum  velocity  than  the  one  measured  in  the  fundamental  mode  of 
vibration.  The  proof  of  this  technical  lemma  follows  some  arguments  of 
D.  L.  Russell  and  will  be  published  elsewhere. 


Theorem  3. 1 

Let  us  assume  a design  S which  does  not  correspond  to  the  coalescing 
point  of  first  two  fundamental  frequencies.  Then  the  (maximal)  optimal  control 
q(x,t),  X € t €•  [0,T],  maximizing  the  total  energy  of  a beam  at  t = T, 
with  the  state  function  W(x,t)  satisfying  zero  initial  conditions,  results  in 
a unique  finite  condition  (W(x,T) , W^(x,T}  at  t = T within  a plus  or  minus 
sign. 


Proof 

This  theorem  can  be  regarded  as  a corollary  to  Lemmas  3.1  and  3.2. 
Clearly,  if  q(x,t)  is  optimal,  then  so  is  -q(x,t).  Consequently,  it  is  easy  to 
show  that  any  optimal  displacements  W2 » ^2  corresponding  to  linearly 
independent  controls  q]|  and  q2  satisfy  orthogonality  condition 


<W, , W^> 


1 t=T 


= 0. 
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Choosing  -q2  as  an  admissible  control,  the  displacement  -W2  corresponding  to 
-q2  satisfies  the  inequality 


Hence , 

(w  , W ) = 0. 

■ “lt=T 


Lemma  3-2  indicates  that  either  W|  or  W£  is  the  state  corresponding  to  a funda- 
mental frequency  . Hence,  without  any  loss  of  generality  suppose  that 
W](x,T)  = (j)]  (x,t)|  |-=-|-,  (j)](x,t)  corresponding  to  fundamental  frequency  . Then 
W2(x,t)  cannot  be  optimal  unless  it  corresponds  to  the  same  eigenvalue,  i.e., 
the  same  frequency  . Since  this  case  is  precluded,  the  proof  is  complete. 


4.  SIMULTANEOUS  OPTIMIZATION  AND  CONTROL 


The  Direct  Approach 


As  before  we  seek  a minimum  over  the  set  of  optimal  designs  S of  the 
maximal  total  energy  e(T)  attained  by  an  admissible  control  q e Q.  We  assume 


llq  [L  ^ 1 . The  theorem  on  uniqueness  of  t 
i ng  a direct  computational  technique  and 
design  S e ,9^  let  C(t,S(x))  denote  the 


e finite  state  is  crucial  in  establish- 
in  estimating  errors.  For  a given 


where  W|^(x,t)  obeys  the  unique  finite  conditions  at  t = T . We  make  use  of  weak 
compactness  of  the  set  of  admissible  finite  states  for  each  design  s e ,9^  to 
assert  the  existence  of  C(t,s(x))  for  almost  all  t « [0,T].  If  constraints 
are  of  the  form 


(4.1) 


W(S(x,t)jdx  ^ 0 


i = 1,2, 


r, 
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then  at  each  Sq  and  t e [0,T],  which  is  a regular  point  of 


q(x,t)  (x,t)  dx, 


use  of  Lagrangian  multipliers  can  be  justified  and  we  could  consider  minimiza- 
tion over  SP  of  the  functional 


r 

(4.2)  J(S)  = e^S(x),  q(x,t)  ,T^  + ^ ' y.<t>^S(x)  ,t]  + p^(  t)  C ^t ; S (x)j 

i=i 


with  p|  - 0 i = 1,2,  ...,  r,  and  Pg  - at  each  regular  point  t e [0,T]. 


Note 

A number  of  known  "tricks  of  the  trade"  can  be  employed  to  change  local 
constraints  to  a global  form  Equation  4.1,  (See  References  3 or  4.) 

If  one  assumes  the  existence  of  a simple  fundamental  frequency  (D](S), 
then  using  recent  results  of  Haug  and  Rousselet  (Reference  5)  one 
can  prove  Frechet  differentiability  of  the  differential  operator  L with 
respect  to  the  design  vector.  (Actually,  one  can  prove  more  than  that, 
but  detailed  results  are  not  needed  here.)  Therefore,  it  is  legal  to 
proceed  with  formal  differentiation  with  respect  to  the  design  vector 
function  S = {S|(x)  ...Sp(x)}.  Regarding  6S  = {6S]  , <5Sp|}  as  an 

arbitrary  increment,  such  that  Sg  + 6S  is  admissible,  we  could  copy  the 
approach  given  by  Komkov  and  CoTeman  in  Reference  6. 

The  formal  differentiation  procedure  produces  incorrect  designs  if  the  two 
lowest  eigenvalues  coalesce,  thus  leading  to  erroneous  conclusions.  This  is  a 
crucial  point  in  the  computational  techniques  based  on  heuristic  arguments. 
Optimization  results  in  numerical  minimization  of  weight  at  constant  eigen- 
frequency  led  in  the  past  to  many  difficulties  including  appearance  of  singu- 
larities, discontinuities  in  design,  and  questionable  local  optima. 

Bearing  this  in  mind,  we  now  proceed  to  optimize  the  design  in  the 
simplest  possible  case,  when  the  structural  system  (in  our  case  a single  beam) 
is  statically  determinate.  The  bending  moment  M(x,t)  does  not  depend  on  the 
design  S;  it  depends  only  on  the  load  q(x,t) , which  maximizes  e (T) . 
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Identifying  the  potential  energy  with  strain  energy,  we  use  the  Hooke's 
law  formula  to  evaluate  s (t) . 


Since  M is  independent  of  S(x)  we  can  evaluate  the  sensitivity  vector  dJ/dS€B* 
(the  dual  of  the  normed  space  B of  which  ^ is  a bounded,  closed  convex  subset). 


We  compute  formally 


where 
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Since  the  last  term  contains  only  design  constraints  which  are  t ime- i ndependent 
and  the  first  term  is  specifically  evaluated  at  t = T,  we  conclude  that  the 
term 


is  also  time  independent  and  can  be  evaluated  at  t = T.  Hence,  for  statically 
determinate  cases  we  can  design  a fairly  simple  computational  algorithm  similar 
to  the  one  offered  in  Reference  6 allowing  us  to  compute  iteratively  the 
optimal  control  and  the  improvements  in  design.  A simple  example  of  such 
computation  was  carried  out  in  Reference  6 for  the  case  of  energy  minimizing 
control . 

Our  results  of  Sections  2 and  3 indicate  that  this  technique  is  appro- 
priate to  optimal  excitation,  that  is,  to  energy  maximizing  controls.  The 
lack  of  convexity  apparently  does  not  destroy  the  effectiveness  of  the  basic 
theory  based  on  Pontrjagin's  principle.  Basically,  these  results  duplicate  the 
conclusions  made  in  Reference  6,  but  in  a theoretically  more  difficult  case. 
However,  a check  must  be  maintained  on  absence  of  multiple  eigenvalues  when  a 
change  in  design  causes  a corresponding  change  in  W^^(x,t). 


5.  THE  ADJOINT  VARIABLE  TECHNIQUES 
We  consider  a weak  form  of  the  state  Equation  0.^,  i.e., 
(5.1)  <A,  ^L(S)W^S(x)  ,x,t^  - q(x,t)^>^  = 0, 


subject  to  boundary  conditions  and  spatial  constrains 


. ^x,S(x)  ,W(x,t)j  = ^.(x)  = 0, 


i = 1,2, 


It  is  known  that  solutions  exist  to  Equation  O.A  in  the  Sobolev  space  H whose 
structure  is  determined  by  energy  inequalities  of  Equations  0.2  and  0.3,  if 
one  prescribes  Cauchy  data,  such  as  W(x,0)  = <|)(x)  e cl  (fi)  , W^(x,0)  = 

e Cl(S]),  and  suitable  boundary  conditions,  such  as,  for  example,  free  sup- 
port at  X = ±Si.  Hence,  Equation  5.1  is  satisfied  for  any  choice  of  sufficiently 
smooth  A(x,t) . Let  J denote  the  functional  given  by  Equation  4.2. 
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A satisfies  the  equation 


(5.2) 


8a(vi.  ,(().  ,J  ,S) 
3W 


in  X [0,T]  and  satisfies  terminal  condition  A (x,T)  = 0. 

The  invariance  of  state  equation  when  the  design  is  perturbed  is 
expressed  by  the  condition 


(5.3) 


L W>, 


<A,  = 0. 


Since  A does  not  depend  on  S,  Equation  5.3  can  be  rewritten  as  a system 
of  r-equations 


(5.4) 


<A,  L-W>  + <L*A,  W->  - <A,q^>  = 0, 


where 


'S  • 


- 3 (c  3I(S)\  3 


3x 


+ P 


3A(S)  / 3' 


Ut^ 


Recalling  Equation  5.2,  we  note  that 


*"i’  M.U  * 

dS  9W  S 3S  • 


Hence , 


<L*A,  W2> 


'3a. 

'3W 


Ws> 


da  _ 9a 
dS  " 9S  ■ 
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The  very  important  term  da/dS  is  essential  in  deciding  sensitivity  of  the  cost 
functional,  in  this  case  of  the  total  energy  at  final  time.  We  observe  da/dS 
can  be  computed  if  one  knows  the  state  function  W(x,t) , the  solution  A of  the 
adjoint  Equation  5*2  and  the  multipliers  y. 

The  most  troublesome  term  Wg  has  been  removed.  One  could  immediately 

apply  gradient  projection  technique  perfected  for  this  problem  by  Haug, 

Arrora,  and  Feng  (Reference  7) > if  we  apply  a fixed  control  q(x,t).  For 
example,  choosing  a specific  design,  computing  the  natural  frequency,  fixing 
a maximum  weight  of  the  beam,  the  maximum  permissible  displacement,  and  the 
minimum  sizes  of  the  beam,  we  adopt  a load  which  is  a Dirac  delta  function 
applied  to  the  free  end  of  a cantilevered  beam,  with  frequency  input 
cos  (80  TTt)  , for  t > 0.  The  constants  were  taken  to  be  E = 30*lo6  psi  , 
ct  a2  = 1 , a = . 3,  «-  = 40". 

This  problem  was  analyzed  at  the  University  of  Iowa  on  an  IBM  360/65 
computer.  The  beam  was  divided  into  ten  finite  element  sections. 


The  optimal  shape  of  the  beam  is  illustrated  above.  Changes  due  to  the 
term  C(S,t),  which  is  absent  in  the  Iowa  analysis,  do  not  appear  to  affect 
seriously  the  optimal  shape.  We  recall  that  the  term 


C(s,t)  - W^  (x,t) 


q(x,t) 


is  independent  of  time  and  only  depends  on  frequency  and,  therefore,  on  S. 

Small  changes  in  design  and  small  changes  in  optimal  control,  particularly  if 
the  first  two  eigenvalues  of  the  operator  L are  not  close  to  each  other,  do 
not  affect  seriously  the  sensitivity,  or  the  iterative  techniques  incorporating 
changes  in  control. 

Numerical  studies  indicate  that  assignment  of  constraints  on  minimum 
sizes  of  admissible  designs  prevents  this  difficulty.  However,  any  numerical 
studies  must  incorporate  the  study  of  changes  in  spectrum  of  the  operator  L. 
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6.  A BRIEF  DISCUSSION  OF  APPLICATIONS 


A question  which  engineers  find  hard  to  answer  concerns  the  design  of  a 
system.  Is  the  design  "good"?  Generally,  a check  is  made  to  determine 
whether  the  design  is  safe  and  reasonably  economical.  Traditionally,  little 
effort  was  made  to  improve  a "satisfactory"  design.  However,  in  the  last 
ten  years  a concentrated  effort  was  made  to  develop  a theory  and  algorithms 
for  iterative  improvements  of  even  very  complex  mechanical  and  structural 
systems,  particularly  of  systems  modelled  by  equations  with  distributed 
parameters.  Some  of  the  most  difficult  theoretical  problems  arise  almost 
casually  In  any  consideration  of  what  is  a "good"  design.  Questions  of 
identification  (i.e.,  how  good  is  the  mathematical  model),  of  reliability  of 
data,  of  our  knowledge  of  physics  of  the  basic  processes,  and  specifically  of 
the  loads  applied  to  the  system  can  never  be  answered  completely  or  satis- 
factorily. In  most  structural  systems,  at  least  in  some  part,  the  loads  are 
unknown  or  random.  When  designing  structures  to  withstand  random  wind  gusts, 
earthquake  loads,  or  unpredictable  dynamic  loading  from  other  causes,  one  must 
design  the  structure  for  the  worst  set  of  conditions  encountered  in  past  years 
or  expected  in  the  foreseeable  future. 

Clearly  one  cannot  design  for  some  average  loads  or  average  winds.  A 
reasonable  assumption  is  to  study  the  "worst"  data  available.  The  techniques 
outlined  in  this  paper  apply  to  basic  concepts  of  engineering  design  if  it  is 
assumed  that  Mother  Nature  conspires  to  present  the  worst  possible  set  of 
controls  over  a reasonable  time  period  [0,T]  corresponding  to  a duration  of 
extreme  wind  gusts,  of  an  earthquake,  or  of  some  other  extreme  dynamic  loads. 
Even  then  our  assumptions  are  open  to  many  criticisms;  but  so  are  other 
assumptions  when  one  tries  to  model  and  predict  physical  phenomena. 
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ABSTRA.CT 

This  paper  reviews  a formulation  for  the  design  of  fixed-order  dynamic  com- 
pensation for  flexible  mechanical  systems  which  is  based  upon  explicit  optimality 
conditions  derived  under  the  minimum  data/maximum  entropy  stochastic  modelling 
approach.  The  optimality  conditions  give  rise  to  the  novel  concept  of  "optimal 
projection"  which  provides  significant  insight  with  regard  to  the  compensator 
structure  and  permits  earlier,  more  restricted  and/or  ad  hoc  design  schemes  to  be 
seen  as  special  cases  of  the  present  formulation. 

1.  INTRODUCTION 

A satisfactory  theory  of  structural  control  design  for  large,  flexible  space 
systems  must  secure  desirable  system  properties  in  the  face  of  inevitable  errors 
arising  in  the  construction  of  large  order  structural  models  and  should  success- 
fully address  the  optimization  of  implementable  dynamic  compehsation.  Ref.  [1] 
reviewed  recent  developments  leading  to  the  minimum  data/maximum  entropy  approach 
to  modelling  and  mean-square  optimization.  Accepting  system  parameter  uncertain- 
ties at  their  a priori  levels,  this  formulation  incorporates  such  uncertainties 
directly  within  the  design  process  by  the  use  of  an  explicitly  stochastic  design 
model  of  the  plant.  By  virtue  of  the  underlying  maximum  entropy  principle,  the 
resulting  stochastic  design  approach  acknowledges  the  minimum  possible  prior 
information  on  parameter  statistics  and  is  capable  of  directly  securing  robust 
stability  and  mean-square  optimality. 

The  design  ramifications  of  this  modelling  approach  in  the  case  of  full-state 
feedback  regulation  are  summarized  in  the  companion  paper^.  This  paper  addresses 
further  optimization  issues.  In  particular,  we  review  the  development,  under  the 
minimum  data/maximum  entropy  model,  of  optimality  conditions  for  fixed-order 
dynamic  compensation.  It  will  be  seen  that,  quite  apart  from  the  Inclusion  of 
parameter  uncertainties,  the  present  formulation  of  mean-square  optimal,  fixed- 
order  compensation  is  novel.  In  particular,  explicit  optimality  conditions  are 
made  available,  and  a rigorous  and  tractable  design  scheme  for  implementable  com- 
pensation is  obtained.  The  compensator  form  which  naturally  emerges  from  this 
formulation  is  fully  defined  by  the  gains  and  by  a projection  (an  idempotent 
matrix)  whose  row  and  column  spaces  are  the  observation  and  control  subspaces  of 
the  compensator.  The  optimality  conditions  given  herein  are  of  such  a form  that 
they  determine  this  "optimal  projection"  together  with  the  optimal  gains.  Since 


* This  work  is  sponsored  by  the  Department  of  the  Air  Force.  The  U.S.  Government 
assumes  no  responsibility  for  the  information  presented. 
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the  compensator  control  and  observation  subspaces  are  not  restricted  at  the  outset 
to  consist  only  of  selected  modal  coordinates,  the  present  formulation  achieves 
greater  design  flexibility  than  the  more  conventional  approaches. 

After  setting  forth  the  basic  maximum  entropy  design  model  in  the  following 
section,  we  present  the  main  optimality  and  stability  results  in  Section  3. 

Section  4 describes  the  relations  between  the  formulation  considered  here  and 
earlier  treatments  of  more  idealized  controller  forms.  The  remaining  sections 
offer  various  geometric  interpretations  and  elucidate  the  approximate  character 
of  the  "modern  modal  control"  approach  (modal  truncation  followed  by  LQG  design) 
in  the  light  of  the  present,  more  general  developments. 

2.  MAXIMUM  ENTROPY  MODELLING  AND  MEAN-SQUARE  OPTIMIZATION 

Consider  control  design  for  a structural  system  modelled  by  n of  its  elastic 
modes.  Extensions  needed  to  include  rigid  body  modes  are  straightforward  and  will 
not  be  explicitly  considered  here.  The  system  equation  assumes  the  form: 


5 = yC  + Bu  + w^ 

a.  1 

n = y5  + w^ 

b.  (1) 

e e , u e R^,  n e 

c.  ) 

where  ? is  the  plant  state,  y is  the  plant  dynamics  map  and  B and  y are  the  con- 
trol input  and  sensor  output  maps,  respectively.  Also,  w]^  is  a white  disturbance 
noise  with  intensity  matrix  V]^  > 0 and  W2  is  observation  noise  with  intensity 
V2  > 0. 

Due  to  various  sources  of  error  arising  in  the  process  of  structural  model- 
ling. there  may  exist  a priori  uncertainties  in  y , B and  y.  It  is  supposed  here 
that  (1)  is  written  in  the  elgenbasls  of  the  structural  system  with  the  nominal 
or  expected  values  of  the  parameters.  In  this  case  as  Refs.  [2,3]  show,  a wide 
class  of  parameter  uncertainties  (embracing  errors  in  the  structural  stiffness 
matrix  and  in  actuator  and  sensor  placement)  may  be  represented  by  skew-hermit ian 
perturbations  in  y alone.  Thus,  arranging  the  state  vector  appropriately,  y 
assumes  the  form: 

y = y + v(t)  , V = -v^  (2) 

where  v is  a zero-mean  random  matrix  and  y is  the  expected  value: 

y = dlag{aj^(i-n^),w^(-i-n2),- ••  ,w„(i-n^),w^(-i-nj^)}  (3) 

with  and  n^  the  nominal  modal  frequencies  and  damping  ratios,  respectively. 

While  explicitly  recognizing  the  stochastic  character  of  the  plant  dynamics 
map,  it  is  desired  to  secure  a controller  which  is  optimal  in  an  averaged  sense 
by  selecting  the  input,  u(t) , to  minimize  the  mean-square  (steady-state)  perform- 
ance: 
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J = lim  j;"-  / ^ dt  E[?^r  5 + u'^r  u] 

t^-t  +«=  1 o ‘"o 

1 o 

> 0 , > 0 

where  the  ensemble  average,  E[,],  embraces  the  v-ensemble. 

In  addressing  this  stochastic  design  problem,  it  must  be  recognized  that  a 
complete  probabilistic  description  of  v(t)  is  never  accessible  to  the  design 
analyst  from  the  necessarily  incomplete  empirical  statistical  data.  To  resolve 
this  difficulty.  Refs.  [4,5]  Introduced  the  use  of  a maximum  entropy  principle 
for  the  construction  of  a full  probability  model  which  is  consistent  with  avail- 
able data  while  being  maximally  unconstrained  otherwise.  Moreover,  the  work 
reviewed  in  Ref.  [1]  identified  a minimum  set  of  parameter  statistical  data  which 
must  be  acknowledged  as  empirically  available  in  order  that  a well-defined  prob- 
ability model  may  be  induced  via  the  maximum  entropy  principle.  The  minimum  data/ 
maximum  entropy  model  is  thus  literally  the  maximum  entropy  model  induced  by  this 
minimum  data  set. 

Since  the  maximum  entropy  modelling  approach  entails  a measure  of  design 
conservatism  and  implicitly  imposes  robustness  constraints  without  recourse  to 
ad  hoc  procedures,  we  adopt  its  use  here  for  the  modelling  of  v(t)  in  (2). 

Although  all  of  the  following  results  can  be  obtained  or  trivially  extended  to 
the  general  formulation  outlined  in  Ref.  [1],  we  limit  consideration,  for  sim- 
plicity in  the  exposition,  to  uncertainties  in  the  open  loop  frequencies  of 
system  (1): 

v(t)  = diag  [i  Jm(v^^)6^(t) ] (5) 

k 

where  Im(.)  denotes  the  imaginary  part. 

Within  the  maximum  entropy  model,  and  by  virtue  of  (5),  system  (1)  has  the 
Ito  differential  representation: 


d?(t)  = y_5(t)  dt  + 3u  dt  + dv(t)g(t)  4-  dW,  (t)  j 

} (6) 

dn(t)  = Yd?(t)  + dW^Ct)  ) 

where: 

M = y -hi  , I = diag  7^  (7) 

k I \ J 

where  T>  is  the  so-called  "relaxation  time"  associated  with  the  dj^(t)  in  (5). 
Assigned  numerical  values  for  the  constitute  the  minimum  data  set  in  this 
case  and,  referring  to  the  expressions  given  in  Ref.  [1] , it  is  seen  that  the  Tj^ 
are  real,  non-negative  and  inversely  proportional  to  the  overall  magnitudes  of 
the  frequency  deviations. 


Within  the  above  conditions,  the  stochastic  design  approach  consists  in 
accepting  (6)  as  the  plant  model  and  then  determining  u(t)  to  minimize  Jg  of  (4). 
This  plan  has  been  elaborated  for  a variety  of  specific  controller  forms.  In 
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particular,  for  full-state  feedback  regulation  as  well  as  full-order  dynamic  com- 
pensation, Refs.  [6,7]  succeeded  in  showing  various  desirable  properties  of  the 
stochastic  design  approach  - notably,  guaranteed  closed-loop  stochastic  stability 
under  the  stochastic  design  model  and  the  automatic  emergence  of  an  inherently 
robust  control  law  for  high  levels  of  modelled  uncertainty.  These  and  additional 
results  described  in  Ref.  [1]  strongly  motivate  extension  of  the  approach  to  con- 
sideration of  fixed-order  compensation. 

3.  FIXED-ORDER  COMPENSATION:  BASIC  FORMULATION  AND  OPTIMALITY  CONDITIONS 

Quite  apart  from  the  issues  connected  with  parameter  uncertainty,  there  is  a 
need  for  re-examination  of  previous  methods  for  fixed-order  compensator  design. 
Perhaps  the  earliest  approach  to  the  design  of  fixed-order  compensation  for  flex- 
ible mechanical  systems  Involves  the  retention,  in  the  design  model,  of  relatively 
few  modal  coordinates  and  the  use  of  familiar  LQG  techniques  for  the  reduced-order 
model.  As  is  presently  well  known,  this  scheme  spawns  a host  of  spillover  insta- 
bility problems.  Furthermore,  various  techniques  advanced  over  the  past  few  years 
which  retain  the  use  of  modal-coordinate  space  decompositions  (i.e. , reduced-order 
modal  coordinate  models)  for  the  purpose  of  spillover  suppression^ , 9 or  model 
reduction^®’^!  are  themselves  subject  to  unwanted  spillover  effects  and  cannot 
guarantee  stability  or  robustness  for  the  full  order  system. 

From  the  above  considerations.  Increasing  attention  has  been  directed  to  the 
general  approach  (originated  by  Newton,  Gould  and  Kaiser^^)  which  accepts  a high 
order  model  of  the  system  while  limiting  the  compensator  order  to  some  fixed  number. 
Using  the  technique  recommended  by  Kwakernaak  and  Sivan  for  gradient  computations 
together  with  a parameter  optimization  algorithm,  Martin  and  Bryson^^  have  recently 
applied  this  approach  to  flexible  spacecraft  control.  The  formulation  of  Martin 
and  Bryson  assumes  a positive  definite  weight  (in  the  usual  quadratic  performance 
index)  on  the  control  input  and  permits  feedthrough  only  for  the  non-noisy  portion 
of  the  sensor  output.  In  marked  contrast,  Johnson^ allows  unrestricted  sensor 
output  feedthrough  and  places  no  weight  on  the  control  input.  Actually,  these  two 
formulations  are  extreme  cases  of  a more  general  scheme  which  allows  limited  feed- 
through combined  with  limited  control  input  weighting  in  a manner  which  avoids 
singularity  in  the  performance  index.  This  general  situation  is  certainly  of 
interest,  but,  to  fix  ideas,  we  consider  the  problem  formulation  of  Martin  and 
Bryson  under  non-singular  observation  noise. 

Thus,  consider  the  closed-loop  system  equations  comprising  (6)  for  the  plant 
together  with: 

u = -Kq  a.  I 

q = aq  + fp  b.  > (8) 

N I 

q e ^ ^ ; N^  < 2n  c.  J 

where  ic  and  f are  the  (time-invariant)  compensator  gains,  q is  the  compensator 
state  (of  dimension  N„  < 2n)  and  a the  corresponding  dynamics  map.  With  these 
restrictions,  we  should  like  to  determine  k,  f and  a to  minimize  J^. 
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Although  the  parameter  optimization  approach  to  this  problem  is  reasonably 
well  developed  (at  least  for  the  case  of  a deterministically  parametered  plant) 
and  its  computational  burden  does  not  seem  prohibitive,  it  yields  little  theo- 
retical insight  and  no  guarantee  of  global  optimality.  Thus  there  is  a need  for 
explicit  optimality  conditions  of  a form  analogous  to  the  results  of  Levine, 
Athens  and  Johnson^^  for  the  optimal  output  feedback  problem. 


Such  conditions  were  derived,  for  the  first  time,  in  Ref.  [16].  Of  course, 
the  stationary  conditions  initially  obtained  comprised  a system  of  non-linear 
matrix  equations  of  formidable  complexity.  However,  the  condition  that  the  gradi- 
ent of  the  performance  index  with  respect  to  elements  of  the  compensator  dynamics 
matrix  vanish  results  in  an  Identity.  In  fact,  the  stationary  conditions  furnish 
no  determination  of  the  compensator  dynamics  matrix  and  there  is  thus  considerable 
latitude  of  choice.  It  was  shown  that  by  judicious  choice  of  the  compensator 
dynamics,  the  originally  very  complex  stationary  conditions  may  be  reduced  to 
relatively  simple  forms.  Defining: 


(M)  . diag  [M^^, 

for  any  M e and  introducing  the  notation: 


a = 3t2  3 


-AH  -1 
a = Y V2  Y 


(9. a) 


(9.b) 


the  final  results  of  Ref.  [16]  which  are  the  object  of  the  present  discussion  may 
be  summarized  as  follows: 

Theorem  1 - Consider: 


0 

0 


U UA 

(p  -oPt)  P + P(p  -oPt)  + I{P  + T Px}  + r, 
mm  1 

(y  -xQa)Q  + Q(y  -TQa)®^  + I{Q  + xQx^}  + v., 
mm  I 


+ t^PoPt 


+ tQoQt 


a. 


b. 


(10) 


where: 


TJ  IJ  IJ/\  IJ/V  

0 = T [(y^-Qo)  T Px  + X Px(y^-Qo)  + PoP] 

0 = X [ (y  -oP)xQx^  + xQx^(y  -oP)'^  + QoQ] 
m m 


A 


X 


g«r 


N x2n 

g,T  e t 


b. 


(11) 


(12. a) 


such  that  r and  g satisfy: 

rg^  = (12. b) 

9 A A 

If  r,  g and  positive  semi-definite  P,  P,  Q and  Q exist  satisfying  (10)-(12), 
then  with  the  choice: 


a = r(y^-fY  -3x)g^  (13) 

for  a in  (8.b) , the  gains: 

AH  ^ 

K = K g , f = rf  (14) 
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where: 


A -1qH„  ^ n H -1 

K = 3 P , f = Qy 


(15) 


determine  an  extremum  of  the  performance  index  (4)  under  the  constraints  imposed 

by  (6)  and  (8).  The  extremum  value,  J*  of  J is  given  by: 

s s 


uA  — 

J*  = tr[P  v^  + T Pt(v^  + QoQ) ] 

Proof:  See  Theorem  2 and  Corollary  1 of  Ref.  [16]. 


(16) 


As  is  evident  from  (13)  the  present  approach  does  not  fix  the  structure  of 
the  compensator  dynamics  in  advance  but,  rather,  determines  it  together  with  the 
gains.  The  consequent  form  of  the  closed-loop  system  under  the  maximum  entropy 
model  of  the  plant  is  seen  to  be: 


• A H 

5 = y?  -3Kg  q + w 


1 


(17) 


q = 'r(y^-fy  -3K)g%  + rf(Y?  + w^) 
where  y is  to  be  interpreted  in  accordance  with  (2),  (5)  and  (6). 

Equations  (10)- (12)  clearly  form  the  core  of  the  design  formulation  and  these 
relations  indicate  that  the  projection  x (note:  x = x^  and  rankx  = Nq  are  implied 
by  (12))  plays  a central  role.  With  the  exception  of  the  results  of  Theorem  4 
below,  necessary  and  sufficient  conditions  for  the  existence  of  solutions  to  (10)- 
(12)  remain  the  object  of  research.  It  is  possible,  however,  to  Illustrate  the 
consequences  of  existence,  particularly  with  regard  to  stochastic  stability  and 
the  minimum  property. 

N xN 

To  facilitate  the  statement  of  the  following  result,  denote  by  <xMx^  e ^ ^ 
the  matrix  of  the  map,  M,  restricted  to  the  subspace  upon  which  x is  the  projection 
with  restricted  codomain.  Then  we  have: 


Theorem  2 - Given  N„,  suppose  that  full  rank  g and  P and  positive  s^mi-^efinite 
P,  Q,  T^^x  and  xQx^  exist  satisfying  (10)-(12)  with  (<x%aPx>^,  <x(y^-Qo)x^) 
detectable.  Then: 


A.  System  (17)  under  the  minimum  data/maximum  entropy  model  for  y is  s^econd 
mean  and  almost  surely  exponentially  stable.  Also,  (17)  with  y = y (that 
is,  the  system  with  nominal  values  of  parameters)  is  exponentially  stable. 

B.  The  gains,  (14),  determine  a minimum  of  the  performance  index  (4)  under 
the  constraints  imposed  by  (6)  and  (8) . 

Proof : See  Theorem  2 of  Ref.  [17]. 

Thus,  under  mild  restrictions  the  formulation  of  Theorem  1 not  only  secures 
stability  for  the  nominal  system  but  also  guarantees  a considerable  degree  of 
stochastic  stability.  The  above  results,  particularly  part  B,  now  justify  use 
of  the  term  "optimality  conditions"  in  connection  with  (10) -(12).  Accordingly, 
we  now  devote  the  remainder  of  the  paper  to  discussion  of  various  properties  of 
relations  (10)-(12). 
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4.  QUALITATIVE  FEATURES  RELATIONS  TO  PREVIOUS  RESULTS 


The  principal  requirements  of  Theorem  1 are  seen  to  consist  of  two  modified 
Rlccatl  equations,  (10),  and  two  modified  Lyapunov  equations,  (11).  The  Rlccati- 
like  equations,  (10),  are  each  similar  in  structure  to  the  stochastic  Riccati 
equation  arising  in  the  case  of  full-state  feedback  regulation.  To  recapitulate, 
earlier  work, Refs.  [4,6],  the  time  invariant  gain,  k,  where 

u = - K 5 

which  extremalizes  is  given  by 

K = r^^3^  P (18) 

where  P is  the  positive  semi-definite  solution  to  the  stochastic  Riccati  equation: 
0 = (17  -oP)^  P + P(17  -oP)  + J{P>  + r,  + PoP  (19) 

mm  i 

Now  examine  the  qualitative  features  of  this  equation.  The  modifications  which 
distinguish  (19)  from  the  standard  Riccati  equation  are  the  term  -hi  in  as 
defined  in  (7)  and  the  term  J{P}.  Both  of  these  obviously  arise  from  the  influ- 
ence of  frequency  uncertainty  and  are  responsible  for  important  effects. 

To  better  understand  the  properties  of  (19)  described  in  Ref.  [1],  let  us 
re-write  (19)  in  the  form: 


y - 

0 = + “POP  + 

k=l  k 

where  [A,B]  denotes  the  commutator  (AB  -BA)  and  the  matrices 

il  ; £ = m = k 

n t-n 
0 ; otherwise 


(20) 


(21) 


are  the  "direction  matrices"  for  uncertainties  of  the  form  (5).  The  structure  of 
the  modification  term  in  (20)  suggests  that  for  large  uncertainty  in  the  k*-^  open- 
loop  frequency  (T^^  small),  the  commutator  [P,B]^^]  will  be  suppressed  - and  this  is 
indeed  the  case.  As  noted  in  Refs.  [4,5],  if  all  uncertainty  levels  are  made  to 
increase  without  bound  (all  the  T]^  approach  zero),  then  P approaches  an  asymptotic 
solution  which  commutes  with  all  the  Bj^  - l.e.  , it  is  diagonal  (and  is  given  in 
closed  analytical  form).  Precisely  because  all  the  [P,B]^]  vanish,  a simple 
Lyapunov  argument  shows  that  the  resulting  rate-feedback  control  is  stable  regard- 
less of  the  values  of  modal  frequencies.  In  the  more  general  case  in  which  lower 
order  modes  are  well  known  while  frequency  uncertainty  increases  with  modal  order, 
analogous  results  are  obtained  and  the  third  term  in  (20)  leads  to  robustness 
levels  comparable  to  modelled  uncertainty  levels  and  reduced  computational  burden 
for  the  high  order,  poorly-known  modes.  With  regard  to  the  explicit  appearance 
of  terns  connected  with  uncertainty,  Bqs.  (10)  have  the  same  structure  as  (20)  and 
similar  effects  are  to  be  expected. 
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Now,  previous  developments  proceeded  in  generalizing  results  obtained  under 
the  regulator  problem  by  first  removing  the  assumption  of  full-state  feedback. 
Supposing  state  information  to  be  available  only  through  a finite  number  of  sen- 
sors (with  non-singular  observation  noise).  Ref.  [7]  derived  optimality  conditions 
for  full-order  dynamic  compensation  under  the  maximum  entropy  model.  Let  us  first 
note  that  both  the  conditions  of  Ref.  [7]  and  standard  LQG  theory  can  be  obtained 
as  special  cases  of  the  conditions  given  in  Theorem  1: 

Lemma  1 


A. 


B. 


Under  the  problem  formulation  of  Section  3,  suppose^that  N = 2n.  Then  (10)- 
(12)  are  satisfied  by  t = l2^  = T = g and  by  P,  Q,  P and  Q satisfying: 

0 = Ti^P  + Py"  + I{P  + P}  -PaP  + r 

mm  1 

— A 

0 = y Q + Qy"  + I{Q  + Q}  -QoQ  + v, 
m m X X -c  VC  ^ 


■ — — HA  A — — 

0 = (y  -Qa)  P + P(y  -Qo)  + PaP 

m m 

_ A A — H — 

0 = (y  -aP)Q  + Q(y  -oP)  + QoQ 

m m 


and  the  extremalizing  gains  are  given  by: 

A -l.H„  _ 6 - H -1  /-o/N 

K = K = r2  3P,f  = f = QY  (24) 

If,  in  addition  to  = 2n,  there  is  no  parameter  uncertainty,  i.e*,  J = 0, 
then  the  extremal  gains  are  (24)  but  with  P and  Q satisfying 


0 = y P + Py  -PaP  + r^ 

0 = yQ  + Qy  -QaQ  + v^ 


(25) 


(Thus,  LQG  theory  is  seen  to  be  a very  special  case  in  the  formulation  of 
Theorem  1) . 


Proof : Immediate  from  Theorem  1. 

Of  course,  (22)  and  (23)  are  precisely  the  conditions  first  obtained  in 
Ref.  [7].  Observe  that  in  the  absence  of  the  terms  J{P}  and  J{Q},  (22)  are  of 
the  form  of  the  stochastic  Riccatl  equation,  (19).  In  general,  however,  (22. a) 
and  (22. b)  are  coupled  through  J{P}  and  J{Q}  so  that  the  effects  of  modal  fre- 
quency uncertainty  preclude  the  separation  principle.  Also,  the  coupling  terms 
effectively  augment  the  diagonal  elements  of  r^^  and  Vi , thereby  demanding  greater 
control  authority  and  stabilization.  Moreover,  from  (23) , P and  Q tend  to 

TT  tj 

increase  with  controller  input,  and  its  dual  f^V2f-  We  may  say  that  P and 

Q represent  the  effect  of  error  "leaking  through"  the  regulator  to  the  observer 
and  vice  versa  by  virtue  of  parameter  uncertainty. 

Additional  properties  of  the  full-order  compensation  formulation  together 
with  illustrative  numerical  results  are  presented  in  Ref.  [7].  In  particular, 
the  behavior  of  solutions  to  (22,23)  for  large  uncertainties  is  analogous  to  the 
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regulator  case  and  a closed-form  asymptotic  solution  (which  is  inherently  robust) 
can  again  be  obtained. 


Finally,  Ref.  [16]  removed  the  restriction  Nq  = 2n  to  obtain  the  results 
stated  in  Theorem  1.  Clearly  (10)  and  (11)  are  very  similar  in  structure  to  (22) 
and  (23)  and  exhibit  all  the  features  noted  above.  A fundamentally  new  element, 
however,  is  introduced  by  the  appearance  of  the  projection  t.  Note  particularly, 
that  in  the  present  formulation,  the  compensator  dynamics  matrix  as  well  as  the 
gains  are  determined  by  the  optimality  conditions  (10)-(12).  Furthermore,  these 
conditions  also  determine  the  Nq-dimentional  row  spaces  of  F and  g which  serve  to 
complete  the  definition  of  the  compensator.  Thus,  we  shall  attempt  to  clarify 
the  central  role  played  by  t in  the  following  section. 

5.  GEOMETRIC  INTERPRETATIONS:  THE  OPTIMAL  PROJECTION 


The  geometric  structure  of  the  compensator  of  Theorem  1 is  best  illustrated 
by  the  quasl-f ull-order  compensator  interpretation  given  in  Ref.  [16]  . Specifi- 
cally, if  we  introduce  the  quasi-full-state  estimate, 


A 

5 


A 


H rf.2n 

g q e C 


(26. a) 


A A 

SO  that  = C and; 


(26. b) 


then,  the  closed— loop  system,  (17),  may  be  written: 

\ = pC  -Skt?  + w^ 

t = x(p  -^Y  -6i<)Tt  + t^(y5  + w ) 

m z 

Thus,  although  the  implemented  compensator  has  the  state  q 
may  be  viewed  as  a quasl-f ull-order  compensator  whose  structure 
projection  t.  Clearly,  sensor  Inputs,  f n , are  annihilated  unless  they  are  con- 
tained in  Im(x*')  (the  row  space  of  F),  while  the  quasi-full-order  state  estimate, 
x5,  employed  in  control  input  is  contained  in  Im(x)  (the  row  space  of  g) . In 
consequence,  we  may  call  Im(x)  and  Im(T^)  the  control  and  observation  (respectively) 
subspaces  of  the  compensator.  Complements  of  these  spaces  are,  of  course,  Im(x^) 
and  Im(x^),  where 


To  distinguish  these  subspaces,  we  shall  term  Im  (x^^)  and  Im(x^)  the  "secondary" 
control  and  observation  (respectively)  subspaces. 


I (2  7) 

/q 

= rc  e it 

is  defined  by  the 


As  can  be  seen  from  the  above  discussion,  the  optimality  conditions  of 
Theorem  1 determine  a projection  which,  in  turn,  induces  all  of  the  important 
subspaces  of  the  problem.  For  this  reason,  we  may  call  the  approach  of  Theorem  1 
that  of  "optimal  projection". 

Now  consider  in  what  manner  the  optimality  conditions  (10)-(12)  specifically 
determine  x.  First  note  that  (10)  are  analogous  to  the  corresponding  relations. 
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(22),  determining  P and  Q for  full-order  compensation  and  do  not  constrain  t 
directly.  Next  examine  (11).  For  convenience,  denote  the  quantities  within 
brackets  on  the  right  of  (11. a)  and  (11. b)  by  Ap  and  Aq  respectively.  Then  (11) 
require  that  Ker[x%p]  = Ker[TAq]  = But  since  = Im(T){^Im(Tj^)  = 

Im(x^)  01m(x^) , (11)  gives  rise^to  the  four  conditions: 

0 = x%pX  , 0 = TAqr^  (29. a, b) 

0 = T^ApX^  , 0 = tAqX^  (30. a, b) 

Expanded  out,  (29)  assume  the  form  of  Lyapunov  equations  (very  similar  to  (23)) 
which  serve  to  determine  (x^Px)  and  (x^x^)  but  not  x.  On  the  other  hand,  with 
Ap  and  Aq  dictated  primarily  by  (10)  and  (29),  conditions  (30)  require  that  1) 
the  secondary  control  subspace,  Im(Xj^),  be  Ap-invarlant  and  2)  the  secondary 
observation  subspace,  Im(x^),  be  Aq- invariant.  Thus,  (30)  ^ constrain  x. 

Actually,  (30)  are  the  only  direct  requirements  on  x since  (12)  merely  demand 
that  there  exist  a basis  in  ^2n  (the  "canonical  basis")  for  which  the  matrix  of  x 
assumes  the  projection  canonical  form: 


T* 


A 


0 

0 


(31) 


In  other  words  (see  Lemma  1 of  Ref.  [17]),  there  exists  a non-singular  <p  e 
such  that: 


X 


X*  (p 


(32) 


Thus,  solution  of  (10)- (12)  is  tantamount  to  determination  of  non-singular  (p 
and  positive  semi-definite  P,  Q,  P and  Q with  x defined  by  (31)  and  (32).  Further 
consideration  reveals  that  the  total  number  of  unknown  parameters  to  be  determined 
exceeds  the  number  of  independent  conditions  implicit  in  (10) -(12)  with  (31)  and 
(32)  by  Nq.  This  merely  reflects  the  fact  that  (10)- (12)  can  determine  x only  to 
within  an  arbitrary  transformation  of  the  compensator  state.  In  particular,  we 
may  re-state  Lemma  2 of  Ref.  [17]: 

N xN 

Lemma  2 - Let  u e iS  ^ be  non-singular.  Under  the  transformation: 


r r „ _ H-i 

r = uT  , g = u 

relations  (10)- (12)  retain  their 


8 ; 


I 


N 

q 


form  except  for  the  substitutions: 


(33) 


g g , r r 


(34) 


Moreover  with  (34)  and: 


~ A -1 
q = u q 

/? 

the  closed-loop  system  equations  for  j ^ 

\q 


retain  the  form  of  (17) . 


(35) 
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Thus,  there  is  still  some  latitude  with  regard  to  the  choice  of  t and  we  may 
impose  Nq  additional  conditions  upon  g and  F so  as  to  simplify  their  form.  One 
representation  of  t which  is  particularly  useful  in  applications  is  given  as  fol- 
lows. Assuming  that  there  exist  t,  P and  Q satisfying  (10)-(12),  they  may  be 
written  in  the  forms: 


T 


I -LG 
_H 


[I,  L] 


(36) 


A 

P 


I 


P [I,  L] 


where 


A 

Q 


I -LG' 


H 


Q [I  -GL  , G] 


N x(2n  -N  ) 
G,  L e iS  ^ 

A A N xN 
i Q e e ^ > 0 


Moreover,  the  transformation  (p  of  (32)  is; 


(37) 


(38) 


■ I 

L 

.,-1 

I -LG^  -L^ 

<p  = 

_-G« 

I -g\^ 

; = 

. G«  I. 

(39) 


where  I denotes  either  I or  I.  „ , according  to  context. 

N 2n-N 

q q 

With  (36) , (12)  are  seen  to  be  satisfied  identically.  Furthermore,  the 
explicit  equations  obtained  upon  substitution  of  (36)  and  (37)  into  (10)  and  (11) 
are  given  in  Lemma  3 of  Ref.  [17].  Here  we  note  that  the  relations  corresponding 
to  (29)  consist  of  two  NqxNq  Lyapunov  equations  determining  P and  Equations 
(30)  give  rise  to  two  asymmetric  Riccati  equations  for  L and  G.  As  will  be  illus- 
trated in  the  next  section,  when  | 1l| | and  j |g|  | are  small,  these  equations  may  be 
approximated  by  two  Lyapunov  equations  which,  in  essence,  determine  first  order 
corrections  to  <p  such  that  (32)  is  more  nearly  satisfied. 

6.  ALMOST  IGNORABLE  STATES  - RELATIONSHIPS  TO  MODERN  MODAL  CONTROL 

To  further  Illustrate  the  relations  of  the  present  approach  with  earlier, 
more  conventional  formulations,  we  now  consider  a situation  in  which  control  and 
observation  subspaces  are  well  approximated  (in  some  sense)  by  Nq-dimensional , 
U-invarlant  subspaces.  Specifically,  let  all  system  matrices  be  expressed  in  the 
eigenbasis  of  the  nominal  system  dynamics,  as  in  Section  2 and  suppose  that; 
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Rl  eRj^2 
eRl2  e R^ 


eVi2 

2„ 

eVi2  e V2_ 


'1 

" * 1 /■ 


a ,b 


0 == 

°1 

H 

£012 

2 

, cr  = 

^ ®1 
-H 

£Oi2 

2- 

eOi2 

e 0^ 

e o^ 

c,d 


(40) 


_ N xN 

^1,  Qi,  CTi  e ^ ^ (N^  even) 


where  e ^ 0 and  all  sub-blocks  R^^,  Rj^2>>*-  etc.  are  bounded  and  independent  of  e. 
Clearly,  for  e sufficiently  small,  (40)  Implies  that  the  last  2n-Nq  states  are 
only  slightly  weighted  in  the  performance  index,  weakly  disturbed  and  nearly 
unobservable  and  uncontrollable  - whence  the  term  "almost  ignorable"  may  be 
applied  to  these  states  for  0 < e <<  1. 


First,  we  consider  the  case  e = 0 and  determine  a solution  to  (10)- (12) 
which  corresponds  to  the  familiar  "modern  modal  control"  approach  wherein  the 
last  2n-Nq  states  are  dropped  and  a full-order  compensator  is  designed  for  the 
reduced-order  model.  For  e = 0,  the  main  result  is: 


Theorem  3 - Assume  conditions  (40)  with  e = 0.  Partitioning  all  matrices  in 
accordance  with  (40) , suppose  that  P2  is  asymptotically  stable  (in  accordance 
with  the  assumptions  of  Section  2)  and: 

— h — h 

(Pf,  cr^  ) , (y^,  ) controllable 

(Rq  , Pq)  , (Oq  , Pq)  r e c o o s t r u c t 1 b 1 e 


Then  the  fixed-order  compensator  optimality  conditions  (10) -(12)  are  satisfied  by 


T = 

r I 

N 

q 

0 

0 

0 

P = 

0 

, Q = 

o' 

_o 

0 - 

_0 

0- 

A 

o' 

A 

4 

Q° 

0 ' 

P = 

. 0 

0. 

, Q = 

0 

0 - 

a,b 


c , d 


o o 

where  Pq,  Qq,  P and  Q are  the  unique  positive  definite  solutions  to: 
0 = ^1^^  + ^?ml  + ^l^P?  + + Rl 


(42) 


(43) 


(44) 
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A A 

/ — r^O — iJO  , “O  / — „0 — N , TiO  -nO 


^0.^0 


Proof : See  Theorem  3 of  Ref.  [17]. 


I (45) 


Conditions  (44)  and  (45)  are  simply  the  full-order  compensation  optimality 
conditions  (22)-(23)  for  the  reduced  order  model  comprising  the  first  states. 
Since  Nq  is  even,  t also  assumes  the  canonical  form  (31)  in  the  usual  modal  coor- 
dinate basis.  Thus  the  modern  modal  control  approach  (which  proceeds  by  modal 
truncation)  amounts  to  the  assumption  that  (42)  is  a decent  approximation  for  x. 


For  e > 0,  this  simple  state  of  affairs  no  longer  prevails.  However,  it 
should  be  expected  that  solutions  of  (10)-(12)  "in  the  vicinity"  of  (42)  and  (43) 
do  exist  provided  e is  sufficiently  small.  Under  modest  restrictions,  this  is 
the  case  - as  the  following  emphasizes: 


Theorem  4 - Suppose  that  e > 0,  but  otherwise  retain  the  conditions  of  Theorem  3. 
Expressing  x,  P and  Q according  to  (36)  and  (37),  consider  the  formal  series 
expansions : 


and 


k=l 


k J(k) 

e P 


G = 


00 


L 


o'*  G<‘'> 


(46) 

(47) 


with  similar  expressions  for  Q,  P and  Q.  Here,  the  leading  terms  for  P,  Q,  P and 
Q are  given  by  (43)- (45),  and  all  remaining  terms  are  obtained  by  regular  pertur- 
bation expansion  applied  to  (10)  and  (11) . 


Under  these  conditions,  there  exists  an  e > 0 such  that  for  all,  e ^ e,  the 
series  (46)-(47)  are  absolutely  convergent  to  solutions  of  (10)- (12). 

Proof : See  Lemmas  4 and  5 and  Theorem  4 of  Ref.  [17]. 


This  result  confirms  the  expectation  that  the  problem  is  well  posed  in  some 
neighborhood  of  e = 0.  Of  course,  the  radius  of  convergence  of  (46)- (47)  remains 
to  be  determined.  However,  this  question  will  be  deferred  to  a subsequent  report 
since  our  main  interest  here  is  the  qualitative  behavior  of  solutions  for  e suf- 
ficiently small  but  non-zero. 

7.  SMALL  e-SOLUTIONS  - SPILLOVER  ACCOMMODATION  THROUGH  OPTIMAL  PROJECTION 

Supposing  that  e is  sufficiently  small  in  the  case  of  almost  ignorable  states, 
the  last  theorem  above  guarantees  existence  of  solutions  to  the  optimality  condi- 
tions of  Theorem  1 and  provides  a perturbation  expansion  which  has  the  modern 
modal  control  approximation  of  Theorem  3 as  its  point  of  departure.  Although 
more  general  existence  results  and  more  satisfactory  computational  methods  will 
be  presented  subsequently,  let  us  persue  the  developments  of  the  last  section  in 
order  to  illustrate  the  manner  in  which  the  present  formulation  modifies  the 
usual  modern  modal  control  approach. 
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To  simplify  the  algebra  somewhat  in  developing  explicit  expressions,  con- 
sider only  the  case  of  a deterministic  plant: 

I = 0 , y = y (48) 

m 

and  suppose  the  primary  and  secondary  modes  to  be  independently  weighted  and  dis- 
turbed: 


V 


12 


= R. 


12 


= 0 


(49) 


With  these  restrictions,  the  results  of  Theorem  3 would  apply  were  it  not 
for  the  non-zero  values  of  012  and  0]^2  (40.c,d).  These  terms  arise  from  the 

controllability  and  observability  of  the  almost  ignorable  states. 


To  illustrate  the  effects  of  non-zero  a^2»  us  develop  the  series 

solutions  of  Theorem  4 up  to  and  including  terms  of  order  e^.  The  results  (see 
Corollary  1 of  Ref.  [17])  may  be  summarized  as  follows:  P > 0,  Q > 0 and  x sat- 

isfying (10)- (12)  may  be  approximated  by: 


where  P, 


P 


X 


P + O(e^)  , 

-LG^l 


Q = Q + O(e^) 
L]  + O(e^) 


Q,  L and  G are  determined  by: 


a,b 


(50) 


y P + Py  -Pa  P + R 


— H '^o^ 
yQ  + Qy  -Qa  Q 
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■ 0 

0 
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, V = V,  + 

_0 

’ 1 

.0 

G^Q°aiQ°G 

a,b  (53) 


and  where  and  a°  denote  a and  a>  respectively,  with  e = 0 in  (40.c,d)  while 
P°,  Q^,  P°  and  are  determined  according  to  (44)  and  (45)  with  = 0. 

(50)- (53)  now  give  an  explicit,  approximate  solution  of  (10)-(12)  for  suf- 
ficiently small  e which  incurs  errors  of  order  e^.  For  e > 0 and,  in  contrast  to 
the  modern  modal  control  approximation  of  Theorem  3,  (50. c)  and  (52)  show  that 
the  optimal  projection  is  not  in  the  canonical  form  in  the  modal  coordinate  basis. 
Rather,  the  vectors  spanning  the  comgensator  observation  and  control  subspaces 
(the  row  vectors  of  [I,L]  and  [I  -GL  ,G] , respectively)  have  small  (0(e))  compo- 
nents along  the  almost  ignorable  states,  represented  by  L and  G.  As  (52)  shows, 
the  magnitudes  ^f  these  are  dictated  by  the  control  and  observation  spillover 
terms,  0^2  and  respectively.  Thus,  some  accommodation  is  made  for  spillover 

effects  by  "rotating"  the  compensator  subspaces  in  the  direction  of  the  almost 
ignorable  states.  The  manner  in  which  this  directly  enhances  stabilization  for 
the  last  2n-Nq  states  is  described  more  fully  elsewhere. 
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A second  major  qualitative  feature  can  be  seen  from  (51)  and  (53).  It  is 
evident  that  the  present  optimality  conditions  effectively  augment  the  state 
weighting  and  disturbance  noise  intensity  for  the  almost  ignorable  states.  This 
augmentation  occurs  through  the  terms  L^pOo^^P^L  and  G^Q^o^Q^G  appearing  in  (53)  . 

In  consequence,  conditions  (10)- (12)  automatically  impose  additional  control 
authority  on  the  almost  ignorable  states  in  direct  proportion  to  the  magnitudes 
of  observation  and  control  spillover  effects. 

Thus,  in  summary,  rather  than  demanding  the  suppression  of  spillover,  the 
optimality  conditions  (10)- (12)  automatically  accommodate  for  spillover  effects 
so  as  to  achieve  overall  closed-loop  stability. 

8.  CONCLUDING  REMARKS 

The  approach,  presented  here,  for  design  of  mean-square  optimal,  fixed-order 
dynamic  compensation  is  seen  to  possess  the  following  salient  aspects; 

(1)  The  formulation  is  predicated  upon  a maximum  entropy  stochastic  model 
of  the  plant  which  not  only  recognizes  the  design  impact  of  structural  parameter 
uncertainties  but  also  Incorporates  a minimum  of  a priori  statistical  informa- 
tion. Such  a maximally  unpresumptlve  stochastic  modelling  approach  provides  an 
implicit  mechcinism  for  the  attainment  of  robust  stability. 

(2)  The  approach  eschews  both  ad  hoc  model  reduction  schemes  and  the  purely 
computational  parameter  optimization  technique  and  is  based  upon  explicit  opti- 
mality conditions  presented  in  Theorem  1.  Such  conditions  reveal  that  the 
"optimal  projection”  defining  the  compensator  structure  plays  a crucial  role  and 
provides  significant  theoretical  insight.  Moreover,  under  mild  restrictions, 
existence  of  solutions  to  the  optimality  conditions  guarantees  closed-loop 
stochastic  stability  under  the  maximum  entropy  design  model. 

Furthermore,  we  attempted  in  Sections  4 and  5 to  provide  insight  into  the 
significance  of  the  fixed-order  compensation  optimality  conditions  and  their 
relation  to  earlier,  more  restricted  formulations.  In  particular,  the  last  two 
sections  above  presented  explicit  results  revealing  the  familiar  modern  modal 
control  approach  as  a special  approximation  under  the  present  formulation.  Most 
importantly,  it  was  seen  that  the  optimality  conditions  of  Theorem  1 can  secure 
global  stability  without  necessitating  suppression  of  observation  and  control 
spillover. 

Clearly,  the  above  features  tend  to  remedy  many  of  the  deficiencies  of  cur- 
rent design  methods.  Computational  techniques  and  illustrative  numerical  results 
will  be  given  in  a subsequent  publication.  Finally,  just  as  in  the  case  of  full- 
order  compensation  considered  previously,  we  may  anticipate  that  the  optimality 
conditions  (10)- (12)  possess  simple  as3miptotic  solutions  associated  with  the 
high-order,  poorly-known  modes.  This  feature  tends  to  reduce  the  burden  of  design 
computation  and  permits  the  use  of  very  high  order  design  models. 
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ABSTRACT 

Large  space  structures,  or  any  mechanically  flexible  structures,  are  in- 
herently distributed  parameter  systems  (DPS)  whose  dynamics  are  modeled  by 
partial,  rather  than  ordinary,  differential  equations.  Such  DPS  are  described 
by  operator  equations  on  an  infinite-dimensional  Hilbert  (or  Banach)  space. 
Hov^ever,  any  feedback  controller  for  such  a DPS  must  be  a finite-dimensional 
(and  discrete-time)  system  in  order  to  be  implemented  with  on-line  digital 
computers  and  a finite  (small)  number  of  actuators  and  sensors.  There  are  many 
ways  to  synthesize  such  controllers;  we  will  emphasize  the  Galerkin  or  finite- 
element  approach. 

Although  the  overall  performance  of  finite-dimensional  controllers  is 
important,  the  first  consideration  is  their  stability  in  closed-loop  with  the 
actual  DPS.  The  analysis  of  DPS  makes  use  of  the  theory  of  semigroups  on  the 
infinite-dimensional  state  space.  We  will  present  stability  bounds  in  both  the 
time  and  frequency  domains  for  infinite-dimensional  systems.  Presently,  the 
frequency-domain  approach  appears  to  yield  more  easily  tested  stability  con- 
ditions than  the  time-domain  approach;  however,  we  will  show  some  relationships 
between  these  two  methods  and  emphasize  the  role  played  by  the  DPS  semigroup 
and  its  properties.  It  seems  to  us,  that  such  stability  conditions  are  essential 
for  the  planning  and  successful  operation  of  complex  systems  like  large  aerospace 
structures . 


1 . 0 INTRODUCTION 

Almost  every  engineering  system  will  exhibit  some  distributed  parameter 
behavior  if  one  looks  at  its  dynamics  in  great  detail.  Consequently,  the  dynami- 
cal behavior  of  such  a distributed  parameter  system  (DPS)  would  need  to  be  modeled 
by  partial,  rather  than  ordinary,  differential  equations  to  be  correctly  repre- 
sented. Of  course,  in  many  cases,  such  detail  is  not  necessary  for  the  successful 
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operation  of  the  system,  and  a lumped  parameter  (ordinary  differential  equation) 
model  is  satisfactory.  Nevertheless,  a large  number  of  current  and  newly  pro- 
posed systems,  such  as  industrial  processes  and  mechanically  flexible  spacecraft 
and  satellites,  are  so  thoroughly  distributed  parameter  in  nature  that  it  is 
impossible  to  ignore  this  in  modeling  and  control. 

There  are  many  new  problems  (and  some  compounding  of  old  ones)  in  feedback 
control  of  DPS.  The  main  questions  which  concern  us  are  (1)  how  to  synthesize 
finite-dimensional  controllers  which  can  be  implemented  by  a small  number  of 
actuators  and  sensors  and  on-line  computers  of  limited  word-length;  (2)  how  to 
assess  the  stability  and  performance  of  these  controllers  in  closed-loop  with 
the  actual  DPS  which  is  infinite-dimensional.  Presently,  a great  deal  of  thought 
has  gone  into  DPS  control  because  of  the  proposed  construction  and  operation  on- 
orbit  of  large  flexible  spacecraft  and  satellites  in  the  near  future.  Quite  a 
large  number  of  control  schemes  have  been  considered  for  these  large  space 
structures  (e. g.  see  the  survey  [1])  and  new  ones  (or  major  revisions  of  the  old 
ones)  are  being  developed  almost  daily.  This  kind  of  activity  has  been  helpful,  we 
believe,  because  it  has  called  the  control  systems  community's  attention  to 
systems  whose  high  performance  and  reliability  is  essential  for  successful 
operation  and  has  focused  it  on  some  of  the  important  "gaps"  between  control 
theory  and  engineering  practice.  To  be  sure  other  applications,  e.g.  Tokomak 
fusion  reactors  and  large-scale  electric  power  distribution  networks,  are  having 
a similar  effect. 

In  past  work,  we  have  been  strongly  in  favor  of  a more  practical  theory 
of  DPS  control  which  incorporates  some  of  the  natural  system  constraints  and  uses 
ideas  in  infinite-dimensions  that  appeal  to  the  intuition  and  experience  of  the 
modern  control  engineer.  We  have  taken  some  (although  not  necessarily  the  only) 
definitive  steps  in  that  direction  [2].  In  this  paper,  we  intend  to  continue  in 
that  spirit  and  focus  on  the  stability  question  of  finite-dimensional  controllers 
in  infinite-dimensional  DPS.  Our  analysis  here  will  emphasize  stability  results 
in  both  the  time  and  frequency  domains  and  some  of  the  relationships  between 
these  results.  The  unifying  concepts  of  operator  equations  on  Hilbert  spaces 
and  the  G^-semigroup  for  DPS  will  be  stressed;  however,  the  resulting  stability 

tests  will  only  involve  data  which,  it  is  reasonable  to  assume,  would  be  available 
or  could  be  estimated. 

This  infinite-dimensional  approach  can  be  used  effectively  in  large-scale 
finite-dimensional  systems,  as  well.  One  very  Important  consideration  in  large- 
scale  or  DPS  is  to  avoid  a dependence  on  precise  knowledge  of  (a)  the  total  system 
dimension  and  (b)  the  full  system  parameters,  especially  those  residual  parameters 
not  used  in  the  synthesis  of  the  controller.  Using  an  infinite-dimensional 
approach  obviates  (a) , and  for  (b)  we  shall  only  need  to  be  able  to  estimate 
the  norms  of  certain  of  the  residual  data. 

In  previous  work,  we  have  emphasized  the  internal  or  time-domain  view- 
point of  exponential  stability  of  the  closed-loop  system,  i.e.  the  effect  of 
impulsive  disturbances  on  the  state  decay  exponentially  to  zero.  However, 
another  viewpoint  is  the  external  or  frequency-domain  view  which  describes  the 
effect  of  persistent  disturbances  on  the  system  outputs.  Here  our  main  results 
in  the  frequency  domain  give  conditions  undej  which  th^  closed-loop  system  is 
bounded  input-bounded  output  stable  in  the  L sense  (L  -BIBO  stable) . 
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2 2 
This  means  any  persistent  L -disturbance,  which  is  bounded,  can  only  produce  L - 

outputs  which  are  also  bounded.  While  the  former  looks  at  the  DPS  from  a state- 

space  viewpoint,  the  latter  looks  at  it  from  an  input-output  standpoint.  Both 

approaches  have  their  strengths  and  weaknesses,  and  they  are  interrelated,  as  we 

shall  point  out  later. 

In  lumped  parameter  systems,  the  BIBO  approach  is  very  popular  (e.g.  see 
[3]- [7])  where  it  appears  to  lead  to  easily  verified  stability  tests.  For 
infinite-dimensional  systems,  see  [8]-[9].  These  ideas  were  applied  to  modal 
control  of  large  space  structures  in  [10].  There  is  also  some  use  of  positivity 
concepts  for  structures  [11];  these  yield  special  conditions  under  which  the 
closed-loop  system  is  "dissipative"  and  hence  BIBO  stable.  Much  is  made  of  the 
"robustness"  of  the  BIBO  designs,  i.e.  their  placidity  in  the  presence  of 
uncertain  system  parameters.  However,  it  should  be  pointed  out  that  the  BIBO 
approach  involves  one  of  several  very  useful  perturbation  methods  and  has  no 
intrinsic  robustness  properties;  robustness  under  uncertainty  can  be  assessed 
with  all  perturbation  techniques  with  varying  degrees  of  difficulty.  The 
relative  ease  accomplished  with  BIBO  methods  is  what  recommends  them  in  lumped 
parameter  systems  and  suggests  their  utility  for  DPS,  as  well. 

In  Sec.  2.0,  the  general  DPS  mathematical  framework  of  this  paper  is 
presented.  Basic  ideas  of  reduced-order  modeling  and  control  appear  in  Sec.  3.0 
and  the  concepts  of  stability  for  DPS  appear  in  Sec.  4.0.  Our  main  results  on 
time-domain  (exponential)  stability  tests  for  the  closed-loop  appear  in  Sec.  5.0 
and  the  new  results  on  frequency-domain  (BIBO)  stability  are  in  Sec.  6.0.  Our 
conclusions  form  Sec.  7.0. 


2.0  DPS  DESCRIPTIONS 

We  will  consider  linear  distributed  parameter  systems  (DPS)  with  the 
following  form: 


' = Av(t)  + Bf(t)  + Tfj^(t) 

. y(t)  = Cv(t);  v(0)=v^ 


(2.1) 


The  system  state  v(t)  will  lie  in  a Hilbert  space  H with  inner  product  denoted 
by  (•,*)  and  corresponding  norm  denoted  by  i|*||-  The  operator  A is  an  unbounded, 
closed,  t ime- invar ian t , linear  differential  operator  whose  domain  D(A)  is  dense 
in  II;  furthermore,  A generates  a C^-|^|migroup  U(t).  This  semigroup  is  the 
analogue  of  the  matrix  exponential  e for  finite-dimensional  systems.  The  input 
operators  B and  C are  both  finite  rank  M and  P respectively.  The  external  dis- 
turbances on  the  system  are  represented  by  the  persistent  term  Ffj^(t)  , where  F 

has  rank  M^,  and  the  impulsive  term  v^.  This  is  becoming  a standard  representation 

for  many  DPS  applications  involving  interior  control  and  measurement. 
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From  the  Hille-Yoshlda  Theorem  (e.g.  [12J  or  [13]),  the  operator  A 
generates  a C^-semlgroup  U(t)  satisfying: 


I lu(t)|  1 < Ke 


(2.2) 


where  K > 1 and  a is  real,  when 


I 1(R(A,A))’^|  1 < .J.  ; n=l,2,...  (2.3) 
(X-KJ)'' 

for  all  real  X>-a  in  the  resolvent  set  of  A.  The  operator  R(X,A)=(Xl-A)  is 
the  resolvent  operator  for  A;  by  definition,  it  is  a bounded  linear  operator 
for  each  X in  the  resolvent  set  of  A.  In  the  special  case,  where  A is  dissi- 
pative, i . e . 


( Av , v) +a  ( V , v)_<0 
(A*v,v)+0(v,v)_<O 


(2.4) 

1 


for  all  V in  D(A)  or  D(A*)  where  A*  is  the  adjoint  of  A,  the  constant  K=1  in 
(2.2);  see  [13]  Theo.  3.2. 


When  a>0  in  (2.2),  A generates  an  exponentially  stable  semigroup  U(t) 
whose  stability  margin  is  a.  The  pair  of  operators  (A,B)  is  exponentially 

M 

stabilizable  when  there  is  a (bounded)  linear  gain  operator  G:H-^R  such  that 
the  operator  A+BG  generates  an  exponentially  stable  semigroup.  There  are  other 
types  of  mathematical  stability  and  stabilizabllity  defined  for  DPS;  some  of 
these  ideas  will  be  discussed  in  Sec.  4.0. 


When  (A,B)  and  (A*,C*)  are  exponentially  stabilizable,  and  there  are  no 
persistent  disturbances  (l.e.,  fjj(t)=0),  it  has  been  shown  in  [2]  that  the 

infinite-dimensional  controller: 


f f(t)  = G ^(t) 


■< 


dv(t) 

3t 


= L v(t)  + K y(t) 


[ v(o)  = 0 


(2.5) 


where  L = A+BG  - KC  with  domain  D(L)  = D(A) , exists,  and  , in  closed- loop  with 
(2.1),  it  produces  an  exponentially  stable  system.  The  gain  operators  G & K 
exist  and  stabilize  the  operators  A+BG  and  A-KC.  This  controller  is  mathemati- 
cally interesting  but  not  very  useful  from  an  engineering  standpoint  since  (2.5) 
cannot  be  implemented  with  a finite-dimensional  system.  In  the  next  section, 
the  concepts  of  model  reduction  and  reduced-order  control  are  reviewed  from  [2] . 


Before  closing  this  section,  we  want  to  point  out  that  without  any  loss 
of  generality  we  can  and  will  assume  in  (2.1): 


r = B 


(2.6) 


(2.7) 


This  is  possible  because  F(t)=Bf (t)+rf^(t)  can  be  rewritten: 


F(t)  = B(f(t)+f^(t)) 


^M 

■ 0 - 

where  B=[Br],  f(t)= 

f(t),  and  fp(t)= 

I 

.0  _ 

L 

fjj(t) . Therefore,  we  will 


assume  (2.6)  and,  henceforth,  (2.1)  will  be  given  by: 

= A v(t)  + B(f(t)+f„(t)) 

(2.8) 

. y(t)  = C v(t) ; v(o)  == 

where  now  the  rank  M of  B reflects  the  total  number  of  actuators  and  disturbance 
inputs . This  representation  will  be  very  convenient  later. 


3.0  MODEL  REDUCTION  AND  FINITE-DIMENSIONAL  DPS  CONTROL 


In  order  to  produce  finite-dimensional  controllers  for  the  DPS  (2.8),  we 
must  make  a lumped  parameter  approximation  of  it.  This  is  done  when  numerical 
methods  such  as  finite  elements  or  finite  differences  are  used  to  discretize  the 
spacial  variables.  In  general,  such  an  approximation  or  reduced-order  model  (ROM) 
is  a (not  necessarily  orthogonal)  projection  of  (2.8)  onto  an  appropriate  finite- 
dimensional subspace  of  H;  usually,  we  will  assume  D(A).  The  ROM  sub- 
space has  dimension  N and  its  projection  is  denoted  by  P^;  the  residual  sub- 
space associated  with  completes  the  decomposition  and  its 

projection  is  denoted  by  P^.  The  total  DPS  state  v can  be  written: 


V = 


V +v 
N R 


(3.1) 


where  v =P„v  and  v„=P_v.  The  choice  of  the  subspaces  and  H_  is  usually 
NNRR  NR 

dictated  by  the  physical  application  and/or  the  numerical  procedures  available 

for  integrating  the  DPS.  When  feedback  control  is  the  ultimate  purpose  of  the 

model  reduction,  certain  choices  of  subspaces  will  yield  advantages. 


A modal  subspace  consists  of  linear  combinations  of  a finite  number  of 
modes  or  eigenfunctions  of  the  operator  A.  Modal  subspaces  have  very  special 
properties  in  control  applications,  e.g.,  and  in  (3.2)  later. 

However,  since  most  engineering  applications  are  too  complex  for  the  exact  modes 
to  be  known,  these  subspaces  are  more  conceptually,  rather  than  practically, 
useful.  The  Galerkin  (or  finite  element)  method  is  the  most  often  used  approach. 

The  projection  of  the  DPS  (2.8)  onto  the  subspaces  and  decomposes 
the  system  into  the  following: 
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(3.2a) 


9t 


^ y = SVS'^R 


(3.2b) 

(3.2c) 


where  Aj^=P^APj^,  etc.  The  terms  and  A^v^  are  called  modeling 

error  and  the  terms  Bj^(f+f^)  and  are  called  control  and  observation  spill- 

over, respectively  [2].  The  reduced-order  model  (ROM)  is  obtained  from  (3.2)  by 
ignoring  the  residuals : 


sr  - 

y - V 


in  any  choice  of  model  reduction  scheme  it  makes  no  sense  if  the  residuals  are 
unstable;  therefore,  we  will  assume  that  generates  a C^-semigroup  U^^(t)  with 

the  property; 


U^(t) 


-V 


t>0 


(2.4) 


with  K >1  and  CTp>0.  Such  a condition  is  naturally  satisfied  in  practice,  as 
long  as  one  is  careful  of  the  selection  of  and  in  theory,  one  would  obtain 
and  from  the  Hille-Yosida  or  dlssipativlty  tests  (2. 3)- (2. 4). 

In  order  to  control  the  DPS  (2.8),  a finite-dimensional  controller  is 
generated  from  the  ROM  (3.3): 


^ = Vn 


3v 


N 


9t  ” 

y = Vn’ 


(3.5) 


This  is  the  most  obvious  candidate  for  a feedback  controller;  however,  there  are 
many  ways  in  which  (3.5)  can  be  modified  and  improved,  as  pointed  out  in  [2]. 
Nonetheless,  (3.5)  is  a good  starting  point  for  the  controller  synthesis;  it  is, 
in  fact,  what  most  control  system  designers  do  with  both  large-scale  and  dis- 
tributed parameter  systems . The  conditions  under  which  a reasonable  controller 
design  can  take  place  are  essentially  that  (A^,Bj^,Cj^)  be  controllable  and  obser- 
vable in  the  finite-dimensional  sense.  Such  conditions  indicate  the  minimum 
number  of  control  devices  necessary  for  the  task  and  also  the  possible  locations 
for  these  devices. 
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Another  approach  is  to  obtain  a finite-dimensional  controller  by  performing 
a model  reduction  on  the  infinite-dimensional  controller  (2.5).  But  this  yields  [2] 
a controller  whose  form  is  the  same  as  (3.5);  hence,  our  later  analysis  will  be 
unchanged. 


4.0  STABILITY  AND  STABILIZABILITY  CONCEPTS  FOR  DPS 


In  infinite-dimensional  spaces  there  are  many  ways  to  define  the  concepts 
of  stability.  In  this  section  we  shall  emphasiz^  the  two  most  useful  ones  for 
engineering  systems:  exponential  stability  and  L -BIBO. 


4.1  Exponential  Stability 


The  linear  DPS  (2.8)  is  said  to  be  exponentially  stabilizable  when  there 
exists  a full-state  feedback  law: 

f(t)  = G v(t)  (4.1) 

M 

with  G:H->-R  a linear  finite-rank  gain  operator  such  that  (2.8)  becomes: 


8v(t) 

9t 


A^v(t)+Bfj^(t) 


y(t)  = C v(t);  v(o)=v^ 


with  A.^=A+BG  generating  a C^-semigroup 

Mu  (t)  11  < K e ° , t>0 
' ' o — o — 


(4.2) 

U^(t)  which  is  exponentially  stable,  i.e. 

(4.3) 


with  constants  K,^>1  and  <^^>0.  This  is  a time-domain  or  state-space  concept 

whd.ch  says  that,  in  the  absence  of  any  persistent  disturbances  (i.e.  fj^(t)SO) 

the  effects  of  all  impulsive  disturbances  decay  to  zero  exponentially,  i.e. 

-a  t 


1 v(t)  1 < K e ° 0 as  t->«> 

' — o 


(4.4) 


The  rate  of  convergence  or  stability  margin  is  in  (4.3). 

There  are  other  types  of  stabilizability  for  (2.8);  these  depend  on  the 
different  types  of  convergence  possible  in  infinite-dimensional  spaces.  For 
example,  (2.8)  is  strongly  stabilizable  when  (4.1)  yields  an  A^  which  generates 

U^(t)  such  that  v(t)  converges  strongly  to  zero  (when  fj^(t)=0),  i.e. 

lim  Mu  (t)v  1 1 = 0 (4.5) 

t->oo 

for  all  v^  in  H.  Alternatively,  (2.8)  is  weakly  stabilizable  when  (4.1)  causes 
v(t)  to  conveirge  weakly  to  zero,  i.e. 

11m  (U  (t)v  ,v)  = 0 (4.6) 

t->oo  ° 
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for  all  and  v In  H.  In  these  last  two  types  of  mathematical  stabilizabillty, 

no  rate  of  convergence  is  available;  consequently,  they  are  not  of  much  practical 
interest  for  engineering  systems.  Therefore,  we  will  emphasize  the  exponential 
stability  of  (4.3)  with  its  stability  margin  which  becomes  essential,  as  we 

shall  see  later.  When  H is  finite-dimensional,  all  of  the  above  stability  con- 
cepts are  equivalent. 

Note  that  the  feedback  law  (4.1)  is  only  of  theoretical  interest.  It 
cannot  be  implemented  in  practice  since  it  requires  instantaneous  knowledge  of 
the  infinite-dimensional  state  v(t) ; except  for  a few  very  special  gain  operators 
G,  this  is  never  practical.  The  actual  control  law  must  be  generated  by  a finite- 
dimensional  system  (3.5);  for  this  we  need  the  concept  of  stabilizing  subspaces. 

We  will  say  (A,B)  in  (2.8)  has  a pair  of  stabilizing  subspaces  and 
if,  in  addition  to  the  model  reduction  requirements  of  Sec.  3.0,  they  also  satisfy: 

A =A+BG  exponentially  stable  with  a desired  stability  margin  0 for  some 
M 

G:H-^R  such  that: 


(4.7) 


Note  that  (4.7)  says  that  G =GP  =0,  l.e.,  A is  stabilized  by  a gain  that  is 

restricted  to  the  finite-dimensional  subspace  H^.  This  situation  often  occurs 

when  modal  reducing  subspaces  are  used  in  "parabolic"  or  "hyperbolic"  problems, 

but  it  can  also  occur  in  other  model  reduction  schemes  for  DPS . It  means  that 

all  but  a finite  number  of  the  states  of  the  system  have  the  desired  stability 

margin  a and  those  that  do  not  may  be  stabilized  by  the  gains  (4.7).  Further- 
o 

more,  using  (4.7),  we  obtain: 


V"nS 


A = 
o 


.^RN-^^R^N 


\r 


(4.8) 


From  (4.8),  it  is  easy  to  see  that,  if  (Aj^,Bj^)  is  controllable  and  | |Aj^^|  | is 
small,  then  the  model  reduction  subspaces  K,  and  are  stabilizing  subspaces. 

W Iv 

Furthermore,  it  seems  highly  unlikely  that  a finite-dimensional  controller  (3.5) 
can  ever  stabilize  (exponentially)  an  infinite-dimensional  DPS  without  the 
existence  of  stabilizing  subspaces. 


The  stabilizing  subspace  idea  was  used  in  [14]  to  unify  stability  and 
controller  synthesis  for  discrete  and  continuous- time  DPS.  As  we  shall  see, 
it  is  also  useful  for  Interrelating  time  and  frequency  domain  stability  bounds. 

2 

4 . 2 L -Bounded  Input  Bounded  Output  Stability 

In  the  previous  subsection,  stability  is  treated  as  an  internal  or  state  space 
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property  of  the  DPS.  Alternatively,  the  stability  of  (4.2)  may  be  looked  at  from 
an  external  or  input-output  viewpoint,  i.e.  what  is  the  behavior  of  the  output 
y(t)  for  a given  disturbance  input  fj^(t).  The  concepts  of  L^-bounded  input, 

bounded  output  (L^-BIBO)  stability  have  been  used  for  both  lumped  and  distributed 
parameter  systems;  see  e.g.  [7]  Chapt.  5,  [8],  [9],  or  [15]  Chapt.  6.  Following 
[9],  we  shall  say  (4.2)  is  L^-BIBO  stable  for  Kp^®®  when,  for  we  have 

(a)  if  fp(*)  is  in  L^(R,R^^),  then  y(*)  is  in  L^(R,R^)  and  (b)  there  exists  a 

constant  M <o°  such  that 


|y<*) i Ip  1 \ 


(4.9) 


where,  for  any  positive  integer  q,  (R,R^)={g( • ) | g:R->R*^  and  | |g(t)  j pdt<«>} 

00  « 

- —00 

with  |ig(*)ll  =(,  I |g(t)  I l^dt)^  ^ for  l<p<o°  for  L (R,R*^)E{g(  • ) ] g:  R-^R*^  and 


sup  ||g(t)||  <“}  with  ||g(*)lloo  = sup  ||g(t)ll.  These  spaces 
teR  t£R 

with  the  given  norms  | [ • [ | and  the  R*^  norm  taken  to  be  | |g(t)  | | 
2 ^ 

when  p=2,  L is  a Hilbert  space. 


are  Banach 
= (g^(t)g(t))^^^ 


In  much  of  the  finite-dimensional  control  literature,  bounded  input, 
bounded  output  stability  is  a fundamental  concept;  see  e.g.  [16]  Chapt.  4.  The 
version  that  is  often  seen  is  L°°-BIB0  stability  where  a bounded  disturbance  pro- 
duces a bounded  output  when  the  initial  condition  is  zero.  It  is  shown  in  [16], 
Theo.  3 p 197,  that  exponential  stability  of  implies  l“-BIB0  stability;  the 

proof  is  for  finite-dimensions  but  it  extends  directly  to  infinite-dimensions. 

In  fact,  for  finite-dimensions,  the  two  concepts  are  equivalent  when  (A,B,C)  is 
controllable  and  observable. 

2 

However,  in  more  recent  literature,  e.g.  [3]-[ll],  L -BIBO  stability  is 
much  more  popular.  The  main  reason  for  this  is  the  Fourier-Plancherel  Theorem 
which  says  that  the  Fourier  transform; 


g(w) 


g(t)  e ^**^^dt 


is  a unitary  operator  on  L , i.e. 


(4.10) 


I |g(w)  1 12  ==  I is(t)  i I2 


(4.11) 


where  the  integral  in  (4.10)  is  taken  in  the  sense  of  "limit  in  the  mean",  i.e. 
it  is  taken  over  the  bounded  Interval  [-N,N]  and  converges  to  (4.10)  in  the 
2 

L -norm  as  The  proof  of  this  result  (sometimes  known  as  Parseval's  Thorem) 

is  given  in  [17]  p 259  or  [7]  App  B.2.  Furthermore,  the  inverse  Fourier  trans- 
form is  given  by: 
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g(t)  = ^ g(a))e^“*^dO)  (4.12) 

-.00 

This  provides  a way  to  transform  from  time  to  frequency  domain  and  return  without 
2 

changing  the  L -norm;  hence,  we  have  the  following  result: 

Theorem  1:  The  DPS  (4.2)  is  L^-BIBO  stable  if 

sup  1|T  (w)||  f M < <»  (4.13) 

-oo<(iO<<» 

where 

= T^(w)  !jj(w)  (4.14) 

with  Tq(«)  the  P X M transfer  matrix  for  (4.2)  and  is  the  same  constant  as  in 
(4.9)  when  p=2. 

2 M 

Proof  I Let  fjj(*)  be  in  L (R,R  ).  Consider,  from  (4.11)  and  (4.14),  that 

00  00 

I|y(-)ll2  = I|y(*)ll2  = j da)  < I |lT^(a))||2  | |fj^(03)  | | ^do)  (4.15) 

— OO  —00 

Use  (4.13)  in  (4.15)  to  obtain: 

OO  00 

l|y(*)|l2lM2  I ||yco)||2da3  = j ||fj^(a))||2  = ||lp(.)ll2 

—OO  00 

and  the  desired  result  follows.  # 

2 

Therefore,  the  sufficient  condition  (4.13)  for  L -BIBO  stability  is 

I I 1 5.  real  o)  (4.16) 

where 

I It  (to)  II  = (T  *(to)  T (to))  (4.17) 

and  is  the  largest  eigenvalue  of  the  Hermetlan  matrix  Q and  T^*(to)  is  the 

conjugate  transpose  (or  adjoint)  of  the  matrix  T^Xto)  for  each  to.  The  singular 

values  of  a matrix  L are  the  square-roots  of  the  eigenvalues  of  the  Hermetian 
matrix  L*L  (see,  e.g.  [3]),  and  they  can  be  efficiently  numerically  calculated. 
Therefore,  (4. 16) -(4. 17)  can  be  written: 

I |T^(o))  I I = sv  (T^(to))  ^ for  all  real  to  (4.18) 
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where  sv(Q)  Indicates  the  largest  singular  value  of  Q.  This  makes  the  determin- 
2 

ation  of  L -BIBO  stability  a simple  matter  even  for  DPS;  so,  (4.18)  is  a fre- 
quency domain  condition  for  stability  of  (4.2). 

We  can  obtain  the  transfer  matrix  1q(w)  for  (4.2)  from 

T^(co)  = C R(jto,A^)  B (4.19) 

where 

R(X,A^)  = (4I-A^)"^  (4.20) 

is  the  resolvent  operator  for  A^  which  is  a bounded  linear  operator  on  H for  each 
A in  the  resolvent  set 


p(A^)  S {Xe  |R(A,A^)  is  bounded  operator  on  H}.  The  spectrum  of 

A^  is  given  by  '3’(A^)  = infinite-dimensional  spaces  it  can  be  quite 

complicated  since  A^  is  an  unbounded  operator,  in  general. 


Furthermore,  the  resolvent  operator  is  the  Laplace  transform  of  the 
C^-semigroup  U^(t)  generated  by  A^  dl7]  p 482): 


R(A,A^)  = 


U^(t)  e ^^dt,  ReA>  0 


(4.21) 


This  leads  to  a relationship  between  exponential  and  L -BIBO  stability: 


Theorem  2:  If  the  DPS  (4.2)  is  exponentially  stable,  then  jO)  is  in  P(A  ) and 

K ° 

||R(jO),A  )||  for  all  real  w (4.22) 

o 

where  the  constants  (K  ,0  ) are  obtained  from  (4.3)  and,  from  (4.22),  the  DPS 
2 o o 

is  L "BIBO  with  in  (4.9)  given  by 

M = I IcI  I 1 IbI I K /a  (4.23) 

O I I I 1 I I I I o o 


Proof 

Theo. 


Take  A = e+jto,  with  e>0,  and  A is  in  P(A^)  by  the  Hille-Yosida 
From  (4.21),  we  obtain: 


1 lR(e+  jw,A^)l 1 < 

o 


|u^(t)| 


-et  -jut 

e e 


dt 


< 


K 


o 


oo 

-(a  +e)t 
e ° dt  1 

o 


K 

o . 

e+a 


O 
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Now,  let  £-^0,  to  obtain  (4.22).  From  (4.19),  since  B & C are  bounded  linear 
operators,  we  have 

1 Mc||  Mb|1  ||R(ja),A^)|| 

1 llcll  MbM 

By  (4.16)  and  Theo.  1,  we  have  the  desired  result.  # 

See  also  the  inequality  (1.39)  in  [17]  p 485.  It  is  not  clear  under  what  con- 
ditions the  converse  of  Theo.  2 holds  for  DPS. 

2 

Later,  we  shall  need  to  consider  L -BIBO  stability  for  feedback  forms, 

l.e. , 

H(w)  = (I-Q(w))"^  (4.24) 

where  Q(to)  is  a bounded  linear  operator  for  each  real  w on  an  appropriate  Hilbert 

2 

space.  Stability  of  H(w)  in  the  L -BIBO  sense  translates  into  H(u)  causal  and 
bounded.  However,  we  shall  not  need  to  worry  about  causality  ([7]  or  [8])  because 
our  control  systems  will  be  designed  to  be  causal  in  the  time-domain;  hence,  our 
analysis  of  stability  will  only  need  to  verify  the  boundedness  of  H(o)) . Almost 
all  results  in  BIBO  stability  make  use  of  the  following  result  (sometimes  known 
as  the  Small  Gain  Theorem) : 

Theorem  3;  If  Q(u))  :H->H  is  a bounded  linear  operator  on  the  normed  space  H for 
each  real  w and 

I1Q(w)|1  < 1,  for  all  real  co,  (4.25) 

where 

I |q| I E inf  {M|  I |Qv| I ;<  M 1 |v| I for  all  v in  H}  (4.26) 

then  H(w)  E (I-Q(to))  ^ is  also  a bounded  linear  operator  on  H for  each  real  O) 
and 

I lH(w)| I < [1-1 lQ(w)| |]"^  (4.27) 

The  proof  of  this  result  can  be  obtained  from  [17]  p 30.  It  uses  the 
Neumann  series; 

OO 

(I+Q)"^  = I (4.28) 

k=0 


which  is  absolutely  convergent  when  (4.25)  is  satisfied;  (4.27)  follows  also  from 
(4.28)  when  Q(<jo)  Is  an  N X N matrix  (l.e.  dimH=N)  , then  (4.25)  becomes: 

sv  Q(w)  < 1,  for  all  real  w (4.28) 
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4.3 


Robustness  of  Stability  to  Regular  Perturbations 


Theo.  3 is  used  in  much  of  the  literature  to  study  the  effect  of  regular 
2 

perturbations  on  L -BIBO  stability.  Consider 


Q(oa)  = Q^(to)  + AQ(aO  (4.29) 

-1 

where  (I-Q^(t^))  is  known  to  be  bounded,  then  (I-Q(co))  is  also  bounded  from 
Theo.  3 when  the  parameter  uncertainty  AQ(w)  satisfies: 

l|AQ(a))|l  < [|l(I-Q^(a)))"^t  |]“^  (4.30a) 

or,  for  matrices, 

sv  (AQ(w))  < £v  (I-Q^(u)))  (4.30b) 

for  all  real  w,  where  ^(*)  denotes  the  smallest  singular  value  and  sv(*)  the 
largest  one.  The  Inequality  (4.30b)  follows  from  (4.30a),  the  definition  of 
singular  values,  and  the  property  [3]  that 


In  lumped  parameter  systems,  (4.30b)  is  particularly  easy  to  evaluate;  hence, 
the  popularity  of  this  approach  for  robust  control  design  under  parameter 
uncertainty.  However,  for  DPS,  the  effect  of  parameter  uncertainty  or  other 
2 

perturbations  on  L -BIBO  stability  can  be  assessed  with  (4.30a).  Loosely  stated 
(4.30)  Is  "gain  stabilization"  of  the  feedback  system. 

The  corresponding  situation  with  robustness  of  exponential  stability  is 
given  by  the  well-known  result : 

Theorem  4;  Let  generate  the  C^-semigroup  U^(t)  satisfying  the  exponential 
stability  property: 

-a  t 

1 lu  (t) 11  < K e ° , t > 0 (4.32) 

''o''—  o — 


where  1 and  b^>0.  Then  the  perturbed  operator: 

A = A + AA 
c o 


(4.33) 


where  AA  is  a bounded  operator,  generates  the  C^-semigroup  U^(t)  satisfying; 

^ -a  t 

llu^(t)ll£  K^e  , t > 0 (4.34) 


where 

f K = K 

MAAM 

*•  c o o ' ' ' ' 


(4.35a) 

(4.35b) 
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with  the  constants  given  in  (4.32).  The  perturbed  system  (4.33)  is 

exponentially  stable  if 

11  Ml  I < (4.36) 

The  proof  of  this  theorem  is  given  in  [12]  p 210;  it  is  a direct  consequence  of 
the  well-known  Gronwall  Inequality.  To  us,  Theo.  4 seems  to  be  one  of  the 
essential  tools  for  establishing  closed-loop  stability  of  DPS  with  finite- 
dimensional controllers,  e.g.  [2].  In  the  next  two  sections,  we  shall  use 
Theos.  3 and  4 to  establish  and  interrelate  some  closed-loop  stability  results 
for  DPS . 


5.0  CLOSED-LOOP  STABILITY  FOR  DPS:  TIME  DOMAIN  CONDITIONS 

In  Sec.  3.0,  model  reduction  is  used  to  obtain  a finite-dimensional 
controller  (3.5)  for  the  infinite-dimensional  DPS  (2.8).  In  this  section,  we 
consider  the  closed- loop  stability  of  the  DPS  with  the  controller  (2.8);  this 
stability  is  not  always  guaranteed  as  various  computer  simulations  and  labora- 
tory experiments  with  flexible  structures  have  demonstrated  [1].  We  want  to  use 

Theo.  4,  of  course,  but  the  main  difficulty  is  to  obtain  K and  O in  (4.32); 

o o 

our  stability  lemma  in  this  section  gives  some  reasonable  bounds  for  this  data. 

We  make  the  following  assumptions: 

(a)  the  ROM  (A^,B^,C^)  in  (3.3)  is  controllable  and  observable; 

(b)  the  estimator  gain  in  (3.5)  is  chosen  so  that  the  spectrum  of 

is  to  the  left  of  a vertical  line  through  o)  the  complex  plane; 

i.e.  Aj^-K^Cjj  generates  the  matrix  exponential  Uj^(t)  satisfying: 

- V ^ t > 0 (5.1) 

where  ^ > 1 and  > 0. 

(c)  the  residual  subsystem  satisfies  (2,4); 

(d)  the  subspaces  and  used  in  the  model  reduction  procedure  are 
stabilizing  subspaces  for  (A,B),  as  described  in  Sec.  4.0;  hence,  A^ 
generates  the  C^-semlgroup  U^(t)  satisfying  (4.3); 

(e)  choice  of  in  (3.5); 

(f)  persistent  disturbances. 

The  controller  (3.5)  can  be  designed  by  standard  finite-dimensional  techniques. 

Let  e =v  -v„  and  obtain,  from  (2.8)  and  (3.5): 

N N N 
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(5.2a) 


f(t)=Gv^(t)  = Gv^(t)+G^e^(t) 

3e  (t) 

(5 -2b) 


Note  that  we  may  omit  P from  (5.2b)  since  it  is  already  incorporated  into  C„ 

K R 

and  The  closed-loop  system  (2.8)  and  (3.5)  can  be  written  (from  (5.2)): 


9v(t) 

9t 


= A v(t)  + 
o 


B 


■ <Vr-V  ''<'=)+(W»> 


(5.3) 


We  have  the  following  closed-loop  stability  result: 


Theorem  5 : If  assumptions  (a)-(f)  are  satisfied,  then  the  gain  may  be 

chosen  so  that  the  finite-dimensional  controller  (3.5)  produces  an  exponentially 
stable  closed-loop  system  (5.3)  when  j I sufficiently  small. 


The  proof  of  Theo.  5 is  obtained  directly  by  taking  A^^=Aq, 

A2^=Kj^Cj^-A^j^,,  and  A22=A^“K^C^  in  the  following  stability  lemma  whose  proof  uses 
Theo.  4 and  appears  in  [2]: 


Lemma  1: 


Consider 

9oo  ~ 

■\l 

^12 

(5.4) 

v—  = A 0)  = 

9t  c 

A 

A 

(A) 

L 21 

22J 

where  A. , 
XI 

are  bounded  for  i 

^ j 

and  A^^  generates 

the  C^-semigroup  U^(t)  with 

the  growth  property: 

- 

-a.t 

X 

u.(t)  < 

K.e 

X 

, t 

• > 0 

(5.5) 

for  i=l,2.  Assume  ^ a^.  Then  A^  generates  the  C^-semigroup  Uj,(t)  with 
growth  property:  t 


|u^(t)||  IV  » t > 0 


(5.6) 


where,  for  exponential  stability, 


with 


(5.7) 
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= min 


Kc  = < K^K^d-H]^) 


(5.8a) 

(5.8b) 


and 


A. 


^ = 


12' 


|V^2| 


where  | |o)|  [ = (||a)j^|]^+  | |o)2 1 | 


The  dual  result  with  A^2  ^21  i^^terchanged  in  (5.7)  is  true,  also.  The  in- 

equality in  (5.8b)  follows  from  (1+a+a^)  < (1+a)^  when  a > 0. 


Due  to  assumption  (e)  and  (5.8a),  we  have 


a = a 

o o 


and  the  exponential  stability  bound  of  Theo.  5 becomes: 

HVe-VN  ‘ 


(5.9) 


(5.10) 


with  given  by  (5.8b)  and 


4>  = I I • <V®o> 


K.K.  - 


-1 


'1^'2 


(5.11) 


where  given  in  (5.1)  and  (4.3),  respectively.  These 

constants  can  either  be  obtained  from  the  finite-dimensional  controller  design 
or  from  the  stabilizing  subspace  calculations  of  Sec.  4.0.  The  stability  test 
(5.10)  involves  the  norm 


ivr-vhi  11*1.11  iicji  +iivi 


(5.11) 


Hence,  with  only  a knowledge  of  the  norms  of  and  A^,  we  can  assure  closed- 
loop  exponential  stability  when: 


I'S.II  lIsM  •*  1 1" 


(5.12) 


Note  that  K involves  the  norm 
c 


iBGjjM  1 MB| 


(5.13) 
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6.0  CLOSED- LOOP  STABILITY  FOR  DPS:  FREQUENCY  DOMAIN  CONDITIONS 

In  Sec.  5.0,  the  Internal  stability  of  the  closed-loop  system  was  guaranteed 
by  a test  of  the  form  (5.12)  involving  the  norms  of  the  residual  data.  In  this 
section,  we  consider  the  closed-loop  stability  from  an  external  standpoint  and 
use  Theo.  3.  We  assume  here  (a) -(e)  of  Sec.  5.0  and  also  (g)  v^  = 0 in  (2.8), 

i.e.  no  Impulsive  disturbances.  The  finite-dimensional  controller  (3.5)  remains 
unchanged . 


The  closed-loop  system  consisting  of  (2.8)  and  (3.5)  may  be  written: 


9t 

y(t)  = C v(t) 


<5.S-V>  - 


The  initial  conditions  are  (g)  and 


eN(o)  = ^^(o)  - Vj^(o)  = -P^v^=0 


(6.1a) 

(6.1b) 

(6.1c) 

(6.2) 


Note  that  the  term  B^^f^  appears  in  (6.1b)  because  the  controller  does  not  know 

the  persistent  disturbance;  this  seems  to  be  the  most  realistic  (and  the  most 
difficult)  situation  to  analyze.  From  (6.1a)  and  (g),  by  taking  Fourier  trans- 


forms we  have: 

v(w)  = R(jw,A^)  B[G^g^(o))  + fjj(w)]  (6.3a) 

y(w)  = C v(w)  (6.3b) 

where  R(jco,A^)  = (jw-A^)  Also,  (6.1b)  and  (6.2)  yields: 

d^(u))  = R^O.)  ^(a»  - B^fj^(o))]  (6.4) 

where 

\(“)  = 

R(^,A^-K^Cj^)  = (6.5b) 

\r  “ Vr“\r  (6.5c) 


This  is  Illustrated  in  Fig.  1 which  suggests  defining 

E(0J)  = d^(u)  + fj^(w)  (6.6) 

From  (6.3)  and  (6.6),  we  have 
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where 


■ (6.7a) 
(6.7b) 


v(w)  = R(jU),A^)  B E(to) 
y(o))  = T^(u))  l(w) 


(6.8) 

(6.9) 


L(w)  = R^(03)  R(jW,A^)  B 
K 

and  the  equivalent  disturbance  fp('^)  is 

Therefore,  from  (6.9), 

E(o))  = S(w)  fp(w) 

where 

S(0))  E (I^-G^L(O)))"^  (6.13) 

which  Is  the  Inverse  of  the  return  difference  operator  I^-Gj^L(oi)) . Furthermore, 
from  (6.7b)  and  (6.12),  we  have  the  closed- loop  Input-output  relationship; 

y(0))  = T (to)  f^(to)  (6.14) 

c D 

where  the  P X M closed-loop  transfer  matrix  Is 

Tc(‘^>  = T^(co)  S((o)  (6.15) 

whenever  S(to)  exists. 

Note  that,  even  In  the  most  Ideal  case  when  (6.14)  becomes: 

5^^ (to)  = T^(to)  ^^(^o)  (6.16) 

^E 

with  Tq(w)  given  by  (6.8)  and  fjj(<J^)  given  by  (6.11).  This  is  not  surprising 
because  there  is  still  a term  due  to  the  unknown  disturbance  fjj(t)  entering  the 


(6.10) 


(6.11) 


(6.12) 


T^(to)  E C R(jto,A^)  B 

Also,  from  (6.4)  and  (6.7a)  in  (6.6),  we  obtain: 

g(to)  = $^(to)  + G L((0)  E(t0) 

U N 

where 
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state-estimator  error  equation  (6.1b).  If  this  disturbance  were  completely  known, 
then  a counteracting  term  could  he  added  to  the  controller  (3.5), and  (6.11)  would 

/S'P^  /N 

reduce  to  • However,  in  most  practical  problems,  the  disturbance 

would  not  be  completely  known;  hence,  we  must  have  (6.11).  It  is  clear  that,  if 

2 g 

fj^(t)  is  an  L -bounded  disturbance,  then  so  is  the  equivalent  disturbance  f^(t); 


this  can  be  obtained  from  (4.11)  and  the  fact  that  (using  Theo.  2); 
I 1Hjj(w)  1 I _<  3.11  real  w 


(6.17) 


where  (K^,a^)  are  given  in  (5.1). 


This  leads  to  our  main  result  for  DPS  stability  In  the  frequency  domain; 


Theorem  6 : If  the  assumptions  (a)-(e)  in  Sec.  5.0  and  (g)  in  Sec.  6.0  are 

satisfied  and  if 


1 1 Vr-vi  I 


< 


N o 


(6.18) 


where  (K^,aj^)  are  given  in  (5.1)  and  (K^,0^)  are  given  in  (4.3),  then  the  finite- 
dimensional controller  (3.5)  produces  an  L^-BIBO  stable  closed-loop  system 
(6.1). 

Proof : From  the  discussion  preceding  the  statement  of  this  theorem,  we  need 

only  show  that  the  operator  S(oj)  in  (6.13)  exists  and  is  bounded  uniformly  for 
all  real  OJ.  Then,  by  Theo.  1 and  the.  condition  (4.16),  the  desired  result  will 
follow. 


However,  by  Theo.  3,  we  need  only  show  that 
I |g^L(w)  I I < 1 for  all  real  o) 


(6.19) 


in  order  that  S(o))  exists  and  is  bounded.  But,  from  Theo.  2,  we  have 

K 

o 


(6.20) 


because  generates  an  exponentially  stable  semigroup  due  to  assumptions  (c)  and 

(d).  .A.lso,  from  assumptions  (a)  and  (b),  (6.17)  is  satisfied.  Note  that  the 
spectra  of  and  are  in  the  open  left-half  of  the  complex  plane;  hence 

jo)  is  in  the  resolvent  set  of  both  operators.  Therefore, 


K 

o 

a 

o 


1b|  1 < 1 


by  (6.18)  and 


the  desired  result  holds.  # 
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Of  course,  when  (5.10)  holds.,  the  closed-loop  system  is  exponentially 

2 

stable  by  Theo.  5,  and,  consequently  by  Theo.  2,  it  is  also  L -BIBO  stable. 
However,  we  would  like  to  compare  (5.10)  with  (6.20)  to  see  if  the  condition 
for  Theo.  6 is  easier  to  satisfy  than  that  for  Theo.  5.  The  answer  is:  yes. 


if  we  bound  in  (5.10)  by 

\ 5 Vo 


N o 


(6.21) 


which  is  obtained  from  (5.8b),  (5.11),  and  (5.13). 
when 


IIVr-*hrI 


a 

o 


+ 


N' 


N o 


Then  (5.10)  is  satisfied 


(6.22) 


Now,  the  right-hand  side  of  (6.22)  is  less  than  or  equal  to  the  right-hand  side 
of  (6,18),  i.e.  from  assumption  (&)  we  have: 


where  a = 


a,  o~a  o-Q 

N ^ N o ^ N 0 


a — a — a+(o  -a  ) , . a 

N o 1+  — 


|B  I and,  therefore. 


O'  o„ 
o N 


a 


Vo“ 

N o 


(6.23) 


(6.24) 


This  says  that,  when  (6.21)  is  used  in  (5.10),  the  resulting  condition  (6.22) 
for  exponential  stability  is  more  stringent  than  the  condition  (6.20)  for 
2 

L -BIBO  stability  of  the  closed-loop  DPS. 


7 . 0 CONCLUSIONS 

Our  main  results  Theos.  5 and  6,  presented  in  Sec.  5.0  and  6.0,  give  tests 
for  exponential  stability  (time-domain)  and  L -BIBO  stability  (frequency  doiTiain) 
of  closed-loop  DPS  with  finite-dimensional  controllers.  For  the  time-domain  test, 
an  internal  view  of  the  system  is  taken  and  exponential  decay  of  the  effects  of 
any  impulsive,  disturbance  on  the  full  infinite-dimensional  state  is  required. 
However,  for  the  frequency  domain  test^  an  external  view  is  taken  and  bounded 
input-bounded  output  behavior  in  the  L -sense  is  expected  for  any  square-inte- 
grable  persistent  disturbance.  The  time-domain  result  is  a consequence  of 
regular  perturbation  of  the  C^-semigroup  generated  by  the  system.  The  frequency 

domain  result  is  obtained  via  the  Fourier-Plancherel  (or  Parseval)  Theorem  for 
Fourier  transforms  and  a version  of  the  Small  Gain  Theorem;  in  this  case,  the 
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system  resolvent  operator,  which  is  the  Laplace  transform  of  the  semigroup, 
becomes  the  important  concept. 

2 

It  was  expected  that  L -BIBO  stability  would  yield  a less  stringent  test 
since  it  is  weaker  for  DPS  than  exponential  stability  (see  Theo.  2).  This  was 
shown  to  be  true  for  some  choices  of  bounds  on  the  data  used  for  Theo.  5;  however, 
it  may  not  always  be  so.  It  should  be  noted  that  both  approaches  are  regular 
perturbation  methods  where  a nominally  stable  system  (e.g.,  the  closed-loop  DPS 
with  and  is  perturbed  by  a bounded  term;  bounds  on  the  size  of  this 

term,  for  which  stability  is  maintained,  produce  the  closed-loop  stability  tests. 

The  stability  results  presented  here  for  the  time  and  frequency  domain  are 
related  by: 

(a)  the  C^-semigroup  of  the  system  and  its  Laplace  transform,  the  resolvent 
operator; 

(b)  regular  perturbation  theory; 

(c)  the  concept  of  stabilizing  subspaces  for  DPS. 

It  seems  to  us  that,  without  the  property  (c),  a DPS  ^ould  have  little  or  no 
chance  of  being  stabilized  (either  exponentially  or  L -BIBO)  by  a finite- 
dimensional  controller. 

Finally,  we  note  that  the  controller  design  was  not  considered  the  focus 
of  this  paper.  The  designs  can  all  be  based  on  finite-dimensional  reduced-order 
models;  hence,  many  time  or  frequency  domain  techniques  are  available.  Further- 
more, to  be  implementable  with  a digital  computer,  the  controller  (3.5)  should 
be  discrete  (rather  than  continuous)  time.  However,  the  main  conclusions,  with 
which  we  hope  the  reader  will  agree,  are  that  closed-loop  stability  analysis 
should  be  an  Intrinsic  part  of  any  attempt  to  produce  finite-dimensional, 
practical  controllers  for  infinite-dimensional  DPS,  and  that  such  analysis  can 
be  carried  out  with  perturbation  techniques  in  many  disguises. 
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ABSTRACT 

This  paper  considers  the  approximation  of  the  optimal  compensator  for  a 
Large  Space  Structure.  The  compensator  is  based  upon  a solution  to  the  Linear 
Stochastic  Quadratic  Regulator  problem.  Colocation  of  sensors  and  actuators  is 
assumed.  A small  gain  analytical  solution  for  the  optimal  compensator  is  obtained 
for  a single  input/single  output  system,  i.e.,  certain  terms  in  the  compensator 
can  be  neglected  for  sufficiently  small  gain.  The  compensator  is  calculated  in 
terms  of  the  kernel  to  a Volterra  Integral  operator  using  a Neumann  series.  The 
calculation  of  the  compensator  is  based  upon  the  Cq  semigroup  for  the  infinite 
dimensional  system.  A finite  dimensional  approximation  of  the  compensator  is, 
therefore,  obtained  through  analysis  of  the  Infinite  dimensional  compensator 
which  is  a compact  operator. 


1.0  INTRODUCTION 

One  of  the  distinguishing  properties  of  a Large  Space  Structure  (LSS)  is 
that  it  is  a distributed  parameter  system  and,  hence,  an  infinite  dimensional 
mathematical  model  is  required  for  its  description.  In  most  applications, 
active  control  of  shape,  attitude,  and  structural  vibrations  will  be  necessary. 

For  such  problems,  formulation  of  the  control  problem  as  a steady-state  (infinite 
time)  linear  quadratic  regulator  is  natural.  The  main  advantage  for  considering 
the  infinite  time  case  is,  of  course,  that  the  optimal  control  gain  is  time 
insariant.  Implementation  of  the  optimal  feedback  control  will  generally  require 
an  estimate  of  the  system  state.  For  the  stochastic  problem,  the  optimal  state 
estimate  is  provided  by  an  infinite  dimensional  Kalman  filter.  However,  it  is 
at  this  point  where  a significant  gap  exists  between  theory  and  practice  in  that 
implementation  of  an  infinite  dimensional  filter  is  not  generally  possible. 

The  most  popular  solution  to  this  dilemma  at  the  present  time  seems  to  be 
reduced  order  modeling  of  the  system,  see  for  example  [1],[2],[3],  and  [4]. 
Usually,  a modal  representation  of  the  system  is  assumed.  The  basic  idea  is  then 
to  evaluate  various  computable  criteria  representing  the  significance  of  various 
modes  and  then  select  some  finite  subset  of  modes  to  represent  the  dominant 
dynamics  of  the  system.  A linear  quadratic  regulator  (controller /estimator  com- 
bination) for  this  finite  dimensional  model  is  then  designed  using  standard 
methods.  The  stability  of  the  system  using  the  resulting  compensator  is  usually 
checked  by  computing  the  eigenvalues  of  the  closed  loop  system  using  a high  order 
"truth  model"  to  represent  the  LSS  dynamics,  where  the  truth  model  is  of  finite 
order,  but  of  much  higher  order  than  the  compensator. 

* Research  supported  in  part  under  Grant  No.  78-3550,  AFOSR,  USAF,  Applied  Math 
Division. 

**  Paper  has  been  revised  (8/6/82)  from  version  distributed  at  workshop  (7/15/82) . 
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There  are  two  basic  problems  with  the  above  approach.  The  first  is  that  the 
approximation  is  performed  upon  the  system  rather  than  the  compensator.  In 
effect,  the  truncation  of  an  infinite  number  of  modes  is  tantamount  to  "approxi- 
mating" an  unbounded,  infinite  dimensional  operator  by  one  that  is  bounded  and 
finite  dimensional.  Thus,  the  infinite  dimensionality  of  the  problem  is  never 
used  in  designing  the  compensator.  The  second  problem  is  that  the  stability  of 
the  closed  loop  system  is  also  checked  against  a finite  order  model  and  the 
closed  loop  stability  of  the  actual  infinite  dimensional  system  is  inferred 
essentially  on  faith. 

To  avoid  these  pitfalls  essentially  requires  that  we  have  a partial  differ- 
ential equation  (PDE)  to  represent  the  system.  For  complicated  structures  this 
may  be  asking  too  much.  Hence,  the  method  of  reduced  order  modeling  may  be  as 
viable  an  approach  as  any  in  those  cases.  However,  we  can,  and  should,  study  the 
problem  of  approximating  the  optimal  compensator  for  appropriate  systems  where 
the  PDE  is  known,  since  we  may  gain  insight  into  the  approximation  problem  that 
would  apply  to  more  general  and  more  complicated  systems. 

This  paper  considers  the  compensator  design  and  approximation  for  a class  of 
systems  representative  of  large  space  structures.  An  analytical  solution  for  the 
compensator  is  obtained  using  the  Infinite  dimensional  model  of  the  system.  A 
finite  dimensional  approximation  of  the  compensator  is  then  derived  based  upon 
analysis  of  the  Infinite  dimensional  compensator. 


2.0  SYSTEM  MODEL 

We  will  consider  the  following  system 

Mw(t)  + A w(t)  = bu(t)  + bn,(t)  (2.1) 

o d 

y(t)  = |b,w(t)]  + e(t)  (2.2) 

(2.1)  is  the  Inhomogeneous  equation  of  motion  of  an  undamped  oscillator  and 

(2.2)  is  the  system  measurement  equation.  w(t)  is  an  element  of  a separable 
Hilbert  space  H and  represents  the  small  displacements  of  the  system  (transla- 
tion and  rotational)  relative  to  its  equilibrium  position.  The  operator  M con- 
tains the  mass  and  inertia  properties  of  the  system.  M maps  H to  H,  is  linear 
bounded,  self-adjoint,  and  positive  definite.  , represents  the  stiffness  of  ' 
the  structure  and  maps  V(A^) , the  domain  of  A^  which  is  dense  in  H,  to  H.  Aq  is 
linear,  self-adjoint,  closed  (or  can  be  closed)  and  generally  unbounded.  We 
assume  there  exists  e>0  such  that 

II  ii2 

[A  w,w]  > e w , w e V(A  ) 

The  spatial  domain  of  A^,  , is  bounded  and,  thus,  the  resolvent  of  A^  is  com- 
pact for  each  X in  the  resolvent  set  of  Aq.  Since  Aq  is  closed  and  has  a com- 
pact resolvent,  its  eigenvalues  are  isolated  (countable) , have  finite  multipli- 
cities, and  have  infinity  as  the  only  limit  point  ([5],  p.l87).  In  addition, 
the  modes  {<f>k,  k=l,2...}  of  Aq  are  orthogonal  and  form  a basis  in  H. 

A single  white  noise  disturbance  n^(t)  is  present  with  a spatial  force  dis- 
tribution defined  by  b e H.  The  support  of  b is  assumed  to  be  small  compared  to 
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the  measure  of  so  that  bnjj(t)  represents  a physical  realization  of  a point 
disturbance.  A single  control  u(t)  and  rate  sensor  y(t)  are  colocated  at  the 
sight  of  the  disturbance.  The  measurement  error  e(t)  is  a white  noise  process 
uncorrelated  with  n^jCt)  . u(t)  , nd(t)  , y(t),  and  e(t)  are  all  elements  of  R for 
each  t e [0,«>)  . Note  also  that  the  control,  measurement,  and  disturbance  are 
all  compact,  since  they  are  each  finite  in  number. 

In  order  to  utilize  semigroup  theory,  (2.1)  and  (2.2)  will  be  put  in  first 
order  form.  Let  Xj^(t)  = w(t)  and  X2(t)  = w(t)  then,  the  system  (2.1)  and  (2.2) 
can  he  written  as 


x(t)  = Ax(t)  + Bu(t)  + Fn^(t) 


(2.3) 


y(t)  = Cx(t)  + e(t) 


(2.4) 


where 


0 

i' 

x(t)  = 

1 

rt 

i 

A = 

-M"^A 

1-  O 

0 

B = F 


0 

M”^b 


C = [0,  (M  ^b)*] 


Note  that  0(A)  = ViAp)  x (/,  where  0 = {w  |A^  w e H} . Note  also  that  A will  have 
a compact  resolvent  since  A^^  does. 


As  a function  space,  it  is  natural  to  use  the  energy  space  E = 1/  x H defined 
by  the  inner  product 


[x,z]^ 


[A"^  X , A'^  z ] + [Mx  , 
o i o i z 


z^l  j x,z  e E 


(2.5) 


It  is  easy  to  verify  that  A = -A*  under  the  inner  product  (2.5).  Hence, 
by  [6,  Corollary  4.3.1],  A generates  a norm  preserving,  strongly  continuous 
group,  T(t). 

The  semigroup  can  be  represented  using  the  modes  of  M ^A^.  The  modes  c|)j^ 
are  orthogonal  under  the  inner  product  [M*,*]  and  are  complete  in  H.  Using  the 
(j)k's,  a set  of  basis  vectors  in  E can  be  defined  by  (])2k  = 

k=l,2,...  The  set  k=l,2,...}  is  complete  in  E and  orthogonal  under 

Thus,  any  x e E can  be  written  as 


^ = ,2: . ^ik  ^k  + ^2k  ^2k 

k=l 

where  [c|)^^,x]£  = x^^^,  [<(>2]^’^]e  ^2k* 

A representation  for  the  semigroup,  can  be  obtained  from  the  homogeneous 
solution  to  (2.3).  Writing  (2.3)  as 


X 


Ax  = X] 


k=l 


^^Ik  ^2k^  '*’lk  *'^2k  "k  ^Ik^  '**2k  ° 
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we  see  that  Xi,  and  x„,  solve 


1 

X • 

1 

■ 0 

r 

1 

r— 1 

. 

— 

2 

0 

L''2k. 

L-“k 

_"2kJ 

for  each  k.  Thus  the  semigroup  can  be  expressed  as 
x(t)  = T(t)  x(0) 


E 

k=l 


''2k 

X,,  (0)  cosm,  t H sino),  t 

Ik  k 0),  k 


Ik 


(2.6) 


+ [-m^  x^^(O)  slnw^t  + ^2k^^^  cosm^t]  (j)2^ 


3.0  STOCHASTIC  CONTROL  PROBLEM 

Given  the  system  (2.3)  and  (2.4),  we  want  to  find  a control  u(t)  to  minimize 


[u]  = lim  [yMB*x(t)  + |u(t)  1^]  dtj 


(3.1) 


where  y>0  and  E{*}  denotes  expected  value.  Let  the  noise  covariances  be  given 
by  E n(j(t)  n^(t)  = o|  I and  E e(t)  e*(t)  = a|.  For  (A,B)  controllable*,  the 
minimum  of  (3.1)  is  attained  for 


u (t)  = -B*  P x(t) 

o c 


where  x(t)  satisfies 


(3.2) 


x(t)  = Ax(t)  + Bu^(t)  + P^B*  [y(t)  + Bx(t)] 
and  P^  and  P^  satisfy,  respectively. 


[P^x,Ax]  + [Ax,P^x]  + y [B*x,  B*x] 


x(0)  = 0 


(3.3) 


- [B*  P X,  B*  P x]  = 0, 

c c 


X e P(A) 


(3.4) 


and 


[P^x,A*x]  + [A*x,P^x] 


[B*x,B*x] 


. (3.5) 

- — [B*  P^x,  B*  P^x]  =0,  X e t>(A*) 

a 

s 

* Approximately  controllable,  see  [6,  Theorem  4.9.2]. 
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It  is  easily  verified  by  direct  substitution  that  under  the  energy  inner  product 

P = yl  and  P.  = — I 

C ‘ ’’i 

thus  (3.2)  and  (3.3)  become 

u^(t)  = - yb*  x(t)  (3.6) 

A . cj  . . 

x(t)  = Ax(t)  + Bu  (t)  + — B[y(t)  -B*x(t)],  x(0)  =0  (3.7) 

o 

The  corresponding  minimal  cost  is 

Remark  3.1.  The  above  solution  requires  that  the  steady-state  Riccati  equa- 
tions (3.4)  and  (3.5)  have  unique  positive  definite  solutions  and  that  x(t) 
be  asymptotically  stationary.  Regarding  (3.4)  and  (3.5),  since  we  have  ana- 
lytical solutions,  the  only  concern  is  uniqueness  and  this  is  guaranteed  by 
(A,B)  controllability  [8].  The  asymptotic  stationarity  of  x(t)  follows  easily 
through  application  of  [6,  Theorem  6.7.1]. 

Eemark  3.2.  For  more  general  problems,  where  for  example  we  replace  B in  (3.1) 
with  another  operator  R,  which  is  Hilbert-Schmidt,  exponential  stabilizability 
of  the  system  is  sufficient  to  guarantee  solutions  to  the  infinite  time,  sto- 
chastic regulator  (ITSR)  problem  [6,  Theorem  6.9.1].  However,  when  compact 
controls  are  employed,  as  is  the  case  here,  the  system  cannot  be  given  a uni- 
form exponential  decay  rate*  unless  the  open  loop  system  is  already  uniformly 
exponentially  stable.  Note  that,  here,  the  open  loop  system  (2.3)  is  unitary, 
i.e. , ||T(t)|l  = 1.  Hence,  the  closed  loop  system  can  only  be  strongly  stable**. 

Remark.  3.2.  The  existence  of  solutions  to  the  ITSR  problem  for  systems  that  are 
only  strongly  stabilizable  is  largely  an  open  problem.  Recent  results  concern- 
ing strong  stability  and  the  steady-state  Riccati  equation  are  given  in  [8],  but 
the  sufficient  conditions  for  existence  are  quite  strong,  requiring  in  particu- 
lar ||r*x||  _<  ||b*x||  for  the  control  problem  and  ||f*x||  j<  M2||Cx||  for  the  filter- 

ing problem,  where  M]^  and  M2  are  constants’^**  One  possible  physical  interpretation 
of  these  requirements  is  that  we  only  attach  a performance  penalty  to  those 
points  where  we  have  located  actuators  and  that  we  locate  a sensor  at  each  dis- 
turbance source. 


* A semigroup  T(t)  is  uniformly  exponentially  stable  if  3 o)>0,  M>1  3 

|fr(t)||  £ Me-iit,  t^O. 

**  A semigroup  is  strongly  stable  if  ||T(t)x||  ->■  0 as  t-x=°,  xeE. 

***R  is  the  state  weighting  operator,  F is  the  disturbance  input  operator,  and  C 
is  the  observation  operator. 
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4.0  OPTIMAL  COMPENSATOR 


Let  T(t)  be  the  semigroup  generated  by  A.  Then,  the  solution  to  (3.7) 
given  by 


IS 


x(t)  = J T(t-t)  IbuCt)  +^B[y(x)  - B*  dx 

old  ) 


(4.1) 


Using  (3.6),  we  can  rewrite  (4.1)  as 

t 


u(t)  = 


-i 


a 


B*  T(t-x)  B jYu(x)  +—  [yy(T)  - B*  x(x)]^  dx 

^d 


J B*  T(t-x)  B j^Y+  u 


yo 

(x)  + - — y(x)[  dx 


Therefore 


L L 

u(t)  + f T(t-x)  Bu(x)  dx  = - k f B*  T(t-x)  By(x)  dx  (4.2) 

^ **0  ^ ‘’o 

where  = y + Og/o^,  k^  = ya^/a^.  Observe  that  (4.2)  is  a Volterra  equation. 
Define  the  Volterra  operator  I as 

t 

Lf  = h ; ^ B*  T(t-x)  Bf(x)  dx  = h(t) 

*^o 

Then  (4.2)  can  be  written  as 


u + k Lu  = - k Ly 

u y 

Solving  for  u we  obtain  the  optimal  compensator  in  abstract  form 

u = - k [1  + k 1]"^  Ly 

c y u ^ 

= - k [1  4-  K]  ly 

y 

where  K is  also  a Volterra  operator  of  the  form 

t 


(4.3) 


(4.4) 


Kf  = h ; J K(t,x)  f(x)  dx  = h 


From  [6,  p. 102-103],  K(t,x)  can  be  computed  iteratively  and  is  given  by 

00 

K(t,x)  = (-l)’^^^  g„(t,x) 


n=l 


(4.5) 


where 


g (t,x)  = g (t-x)  = f g(t-o)  g . (a-x) 

u n V II"”  I 


da 


(4.6) 
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g(t-x) 


B*  T(t-x)  B 


(A. 7) 


Noting  that 

^ g^(t-t)  f(x)  dT 

o 

(4.4)  can  also  be  written  as 

u = - k |l-k  +k^i.^-k^L^+...lLy 

c y ( u u u ) 

!00  \ 

13  ^ j y (A. 8) 

n=l  ) 

Since  B*  T(t-T)  B is  uniformly  continuous  and  (4.3)  is  a Volterra  equation,  the 
iteration  defined  by  (4.5)-(4.7)  and,  hence,  the  series  (4.8)  converge  for 
te[0,T'l  ,Tj<t,T<  [6].  The  series  (4.8)  is  sometimes  called  a Neumann  series, 
[9].  Thus,  the  optimal  compensator  is  of  the  form 

ft 

u(t)  = - k j g (t-r)  y(x)  dx  (4.9) 

y J ^ o 

where  the  kernal  g^(t-x)  is  given  by  the  bracketed  term  in  (4.8).  Note  that 
(4.8)  utilizes  the  infinite  dimension  system  model  via  (4.7)  which  is  given  in 
terms  of  the  open  loop  semigroup  operator,  T(t),  and  we  have  a representation 
for  T(t) , (2.6),  using  the  modes  of  In  particular,  expanding  (4.7)  on  the 

orthonormal  basis  for  E,  as  defined  in  Section  2,  gives 

00 

g(t-x)  = B*  T(t-x)  B = ^ b^  coso)j^(t-x) 

k=l 

where  b^  = [M(M"^b),  = [8,  c|)j^]. 

Properties  of  g(o) 

i)  g(a)  ->  0 as  cr  ->  0° 
ii)  g(n)  is  uniformly  continuous 
iii)  g(a)  is  compact  for  each  t ^ 0. 

Condition  i)  follows  since  g(o)  = B*  S(o)  B and  S(a)  is  the  Cq  semigroup 
generated  by  A - k^BB*  which  is  strongly  stable.  Note  that  S(a)  is  also  a con- 
traction, i.e.,  ||s(a)||  < 1.  The  second  condition  results  from  the  strong  con- 
tinuity of  S(o).  For  condition  iii),  g(cr)  is  compact  since  B is  compact  and 
S(a)  is  a bounded  operator. 
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5.0  SMALL  GAIN  APPROXIMATION  TO  OPTIMAL  COMPENSATOR 

Using  the  procedure  outlined  in  Section  4.0,  four  complete  Iterations  and 
a partial  fifth  iteration  were  performed.  This  yielded  an  approximation  for  the 
kernal  of  the  optimal  filter  given  by 


go(cT)  = 


= V 


/ . costjj,  a 
k=l  ^ 


sinto  CT 

£ ^Ck  \ 


OO  00 


- E E 

k=l  j=l 
k?^j  ' 


^k  r 


kj  01. 


k 


1 


‘ - Lk"  + l<«ck”>' 


+ k^  0,  (k  ; — ; a) 
u 4 u w, 
k 


1,2  . 

b.  sinoi.a 


+ 


jk  01 . 
1 


(5.1) 


where 

^ck  = ^u  ^k/2  > = \ - “J)  > a = t - T 

2 1 

The  remainder  term  k^  O^Cky  ; (jj—  ; a)  is  given  in  the  appendix  and  is  composed 
of  terms  of  second  order  or  greater  in  k^. 


It  is  apparent  that  the  terms  in  the  square  brackets  in  (5.1)  are  the  first 
few  terms  of  the  Taylor  series  expansion  for  exp  ("Sckt) . Thus,  the  limiting 
form  of  the  kernel  is 


go (a)  = i^(a)  + k^  0(k^;  ^ ; a) 

k 


where 


00  00 


So*")  ■ S \ 

k=l  k=l  j = l 


2 "^ck'^  r slnoi  a 

glk(a)  = ,b^  e — 


82kj^'^)  = \j  \ ^ 


2 -^ck' 


\\  / j V “j  / 


(5.2) 


(5.3) 


(5.5) 


Note  that  the  component  filters  gn^(cr)  have  second  order  dynamics  while  the  fil- 
ters g2kj (®)  have  fourth  order  dynamics.  Also,  all  the  parameters  in  gik(o)  are 


458 


related  to  the  kth  open  loop  mode,  while  82k related  to  the  cross-coupling 
between  the  kth  and  jth  modes.  Due  to  the  presence  of  the  factors  1/uk 
in  g2i^j (0) , the  cross-coupling  terms  are  attenuated  relative  to  the  "diagonal" 
terms  gik(t-T)  as  wk  and  Uj  become  large.  Thus,  at  high  frequencies  the  dominant 
filter  dynamics  are  due  to  the  diagonal  terms  gj^k^^^  • Since  the  iteration  con- 
verges and  since  go(o)  and  gQ(o)  are  continuous  in  a,  k^  0(k^;  l/o)k;<?)  is 
continuous  in  a.  Also  k^|  0(ku;  l/aik;^)  | ->  0 as  ku  0 for  each  oe[0,T].  Therefore, 
for  any  e >0,  we  can  find  k^  sufficiently  small  such  that 


max  |k 
0<t<T  ^ 


0(k 


< e 


The  argument  l/m|^  is  used  to  indicate  that  1 0(k^;  1 /wk*'?)  1 0 as  tOk  ->■  «>.  Therefore, 

assuming  k^  is  small,  the  approximate  optimal  control  is  given  by 


A ^ 

u(t)  = -k  1 .g  (t-t)  y(T)  dr 

y V O 

o 


00  t 


-k 


Qo  00  t 

k=i  •’o  y k=i  j=i  •'o 


k?^j 


Since  ky  and  k^  are  both  small,  as  crude  first  approximation,  the  double  sum- 
mation in  (5.6)  could  be  neglected.  In  this  case,  the  filter  takes  a particularly 
simple  form,  namely, 


i(t)  = X) 


(5.7) 


k=l 


where 


^ r 

u^(t)  = -k  ^ g^^(t-T)  y(r)  dt 

y o 

Here,  the  basic  structure  of  the  compensator  is  an  infinite  bank  of  second  order 
filters,  each  operating  on  the  sensor  output  and  the  control  is  simply  the  sum 
of  their  outputs.  The  component  filters  gjj^(t-T)  in  this  case  have  a rather 
special  property.  Letting  Gj^k(s)  denote  the  Laplace  Transform  of  gj^^(t-T)  , we 
have 


Gik(s) 


hh 

k 

2 

s + 25 


ck^  + + “k 


Such  a filter  is  strictly  positive  real,  [11],  [12].  Note  that  the  phase  shift 
of  the  filter  never  exceeds  ±v/2.  According  to  [13],  strictly  positive  real  fil- 
ters have  robust  stability  properties  in  the  sense  that  they  will  stabilize  any 
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positive  real  system  (In  an  input/output  sense).  The  system  considered 
here  is  positive  real  since  it  is  an  undamped  oscillator.  Note  that  any  finite 
sum  of  positive  real  filters  is  also  positive  real.  Therefore,  we  can  truncate 
the  summation  (5.7)  at  any  finite  number  of  terms  and  the  compensator  will  still 
stabilize  the  system.  It  would  be  expected  that  the  more  terms  retained  in  the 
summation  the  closer  the  resulting  performance  would  be  to  the  optimal  performanca 

Returning  to  the  filter  defined  by  (5.6)  another  approximation  will  be 
obtained.  First,  consider  the  Laplace  Transform  of  g2j^j(t)  given  by 


k 

u 


2 

+ “j 


+ 


2 

b:  Q., 



2 2 
(s  + 5 . ) + m . 

cj'  3 


Using  the  definitions  of  5ck»  % j » etc,  this  becomes 


k G„  . (s)  = k 
u 2kj  u 


b^  b 
k 


2 2 2.  2 ^ . 2 , 2 2 , 2.  ^ . 2,  2 , 4 2 ,4-.,.) 

.<(00,  - 0) . ) s + 2k  (m,  b . - u . b,  ) + k (m,  b . - m . b^  )/4; 
1 r k u k .1  .1  k^  u k ,1  ,i  1 I 


/ 2 2. 
(“k-<^j) 


^S+-y-j  +0)^ 


Dropping  all  terms  second  order  or  greater  in  k gives 


w2  k2  2 

\ b s 

Ic  Grt-I  • C®)  “ ^ V)'  r\  ' /J  ry  ry 

u 2kj  U/2.,,2.  2v,2.,,2 


2T‘ 


^ (s  + k^b^s+  Wj^)  (s  + k^bjS+  oij) 

2 2 

where  G|]^(s)  is  the  Laplace  Transform  of  gij^(t)  (neglecting  ?cs  the  denom- 

inator) . Thus  the  Laplace  Transform  of  the  compensator  is 


00  00 


+ K E E 


t.2  .2  2 

b,  b . s 
J 


\ E 2^.  u2  ^ 2 ' “"u^  , 2^^  , 2 ^ 2,  . 2^,  ,2  , 2, 

^ k=l  s +k  b,  s+m,  ^ k=l  j=l  (s  +k  b,  + o),  ) (s  +k  b . s + o) .) 

u k k u k k u j j 


(5.9) 


In  the  time  domain,  the  approximation  to  the  optimal  control  is  now  given  by 

cot  ““"tft  1 

u(t)  = -k  f W E E ( ](  (cf-T)da|y(r)dT 

^ k=l  •’o  ^ k=l  j=l  o (*^T  ' * 

k/j  (5.10) 

This  representation  is  interesting  in  that  the  basic  building  blocks  of  the  fil- 
ter are  the  positive  real  component  filter  gjk(t“'^)  • Note,  however,  that  the  con- 
volutions 
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’3kj 


(t,T)  = J g^^(t-a)  gjj(a-x) 


da 


or,  equivalently,  the  products 


are  not  positive  real.  However,  since  the  diagonal  terms  dominate  at  high  fre- 
quencies, the  high  frequency  component  filters  approach  a positive  real  form. 
Since  we  would  expect  a close  (finite  dimensional)  approximation  to  the  optimal 
filter  to  stabilize  the  system,  it  is  interesting  that  the  limiting  structure 
is  positive  real,  i.e.,  a structure  that  is  compatible  with  truncation. 


6.0  CONCLUSIONS 

It  has  been  shown  that  the  optimal  compensator  can  be  represented  in  terms 
of  a kernal  of  an  integral  operator.  The  results  demonstrate  that  the  integral 
representation  provides  considerable  Insight  into  the  structure  of  the  compen- 
sator. This  is  due  to  the  compensator  being  compact  and,  hence,  something  that 
is  inherently  approximatible  by  a finite  dimensional  operator,  i.e.,  the  compen- 
sator parameters  go  to  zero  as  k ->  <».  For  the  system  considered  here,  the  opti- 
mal filter  can  be  constructed  of  basic  building  blocks  which  are  second  order 
positive  real  filters.  The  first  (low  gain)  approximation  is  a diagonal  array 
of  these  component  filters  and  there  is  a one  to  one  correspondence  with  the 
open  loop  modes.  The  second  (low  gain)  approximation  adds  in  the  cross-coupling 
effects  in  the  form  of  fourth  order  filters  which  are  convolutions  between  the 
various  component  filters  taken  two  at  a time.  At  high  frequencies,  the  dynamics 
of  the  compensator  approach  the  diagonal  form  of  the  first  approximation  which 
should  have  advantages  when  the  filter  dynamics  are  truncated  in  order  to  obtain 
a finite  dimensional  approximation  for  the  compensator.  The  results  also  seem 
to  suggest  that  higher  gains  will  require  increasingly  higher  order  convolutions 
of  the  component  filters. 
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APPENDIX 

The  following  defines  0^(ku;t),  the  remainder  term  for  the  optimal  compen- 
sator, after  four  complete  iterations  and  part  of  a fifth  iteration; 
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ANALYSIS  OF  STRUCTURAL  PERTURBATIONS  IN 
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Abstract 

It  has  long  been  common  practice  to  analyze  linear  dynamic  systems  by  decomposing  the  total  response 
in  terms  of  individual  contributions  which  are  easier  to  analyze.  Examples  of  this  philosophy  include  the 
expansion  of  transfer  functions  using:  (i)  the  superposition  principle,  (ii)  residue  theory  and  partial 

fraction  expansions,  (iii)  Markov  parameters,  Hankel  matrices,  and  (iv)  regular  and  singular  perturbations. 
This  paper  summarizes  a new  and  different  kind  of  expansion  designed  to  decompose  the  norm  of  the  response 
vector  rather  than  the  response  vector  itself.  This  is  referred  to  as  "cost-decomposition"  of  the  system. 
The  notable  advantages  of  this  type  of  decomposition  are:  (a)  easy  application  to  multi-input,  multi- 

output systems,  (b)  natural  compatibility  with  Linear  Quadratic  Gaussian  Theory,  (c)  applicability  to  the 
analysis  of  more  general  types  of  structural  perturbations  involving  inputs,  outputs,  states,  parameters. 
Property  (c)  makes  the  method  suitable  for  problems  in  model  reduction,  measurement/actuatdr  selections, 
and  sensitivity  analysis. 

1.0  Introduction 

For  many  reasons  it  is  convenient  and  often  necessary  to  characterize  dynamic  system  behavior  in 
terms  of  an  expansion  of  the  system  response  into  terns  which  are  individually  easier  to  analyze  than  the 
total  response.  Forexample,  it  is  common  practice  in  the  study  of  linear  systems  to  take  advantage  of 
the  superposition  principle  to  separately  analyze  the  effect  of  each  input  u,  on  each  output  y.  of  the 
system,  ^ 


y^-(s) 


I G..(s)  u.(s) 
j = l ^ 


(1.1) 


or  to  analyze  the  contribution  of  each  mode  of  the  system  by  using  the  partial  fraction  expansion 


n R. 

y(s)  = G(s)  u(s)  = I g-  _ ■ u(s) 


(1.2) 


In  lieu  of  poles  and  residues  the  expansion  can  take  on  many  other  forms  including  the  use  of  Markov 
parameters,  Hankel  matrices,  and  various  other  moments.  In  time  domain  analysis,  y(t)  may  be  expanded 
in  the  power  series  form 


y(t)  = I e%^(t)  (1..3) 

n=0 

where  e is  a small  parameter  in  the  system  and  y (t),  n = 0,  1,  ...,  constitute  the  order  approxima- 
tions of  y(t)  in  perturbation  analysis.  The  reglllar  and  singular  perturbation  analysis  0 .3)  has  been 
developed  extensively  in  both  open-loop  and  closed-loop  problems  embracing  both  linear  and  nonlinear 
systems  [l]-[2].  In  the  presence  of  structural  perturbations  in  the  state  (singular  perturbations)  the 
expansions  of  the  form  (3)  can  be  readily  employed  to  predict  the  perturbed  behavior,  whereas  (1.2)  is 
useful  only  if  the  structural  perturbations  occur  in  modal  coordinates.  Other  types  of  structural 
perturbations  occur  if  the  number  of  inputs  or  outputs  change.  The  effects  of  these  input  or  output 
perturbations  can  be  approximated  by  expansion  (1.1)  or  (1.3). 

The  purpose  of  this  paper  is  to  investigate  a different  kind  of  expansion  of  the  response  y in  the 
analysis  of  dynamic  systems  subject  to  all  types  of  structural  perturbations;  state,  input,  output. 

This  expansion  is  called  a "cost  decomposition"  since  it  involves  expansion  of  a soalar  function  of  the 
output  y,  denoted  by  the  inner  product 


p ^ <y,y>  = I V.  (1.4) 

i = l ^ 

where  V-;  are  the  "component  costs"  denoting  the  contribution  of  the  i system  "component  in  the  overall 
performance  metric  t/.  The  motivation  for  the  decomposition  of  the  scalar  (1.4)  is  that 

(a)  the  overall  system  performance  metric  is  often  characterized  in  a quadratic  form  and 

the  expansion  (1.4)  allows  one  to  determine  the  relative  significance  of  the  components 
of  the  system.  This  relative  significance  is  easily  established  by  comparing  the 
scalars  l/j^  ^ V2 
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(b)  the  component  costs  provide  information  which  can  be  used  to  advantage  in 
these  problems: 

1.  Model  Reduction 

2.  Construction  of  Reduced  Controllers 

3.  Construction  of  Reduced  Estimators 

4.  Sensor/Actuator  Selection 

5.  Parameter  Sensitivity  Analysis  (determining  "critical"  parameters) 

This  paper  shows  how  to  exploit  cost  decomposition  methods  (4)  to  generate  suboptimal  solutions  to  problems 
1-4. 


The  paper  is  organized  as  follows.  Section  2.0  describes  cost  decomposition  procedures  for  determining 
"component  Costs,"  "input  costs,"  and  "output  costs."  Section  3.0  applies  these  ideas  to  model  reduction. 
Section  4.0  applies  these  ideas  to  controller  reduction.  Section  5.0  applies  these  ideas  to  sensor/actuator 
selection.  Some  concluding  remarks  appear  in  Section  6.0. 

All  of  the  ideas  in  this  paper  can  be  (some  have  been)  extended  to  time-varying  plants  with  finite 
time  performance  metrics.  However,  in  the  interest  of  brevity  only  the  stationary,  infinite  time  results 
will  be  presented. 

2.0  Cost-Decomposition 

Consider  the  linear  system 


Ax  + Dw 

XeR  , 

, wcR« 

(2.1) 

Cx 

yeR  ^ 

where  the  matrix  pair  (A,D)  is  controllable  and  the  matrix  pair  (A,C)  is  observable.  The  input  w(t)  is  a 
zero-mean  white  noise  with  intensity  W(Ew(t))  = 0,  Ew(t)wT(x)  = Ws(t-x)).  The  model  (2.1)  exists  for  the 
purpose  of  accurately  describing  the  outputs,  y, , ...,  y^  . Therefore,  a suitable  measure  by  which  to 
quantize  the  performance  of  (2.1)  is  ^ 

= lim  E||y||^  , ||y||^  = y^Qy  (2.2) 

t-HO  ^ ^ 

The  matrix  Q weights  the  outputs  according  to  their  importance  in  the  performance  criteria. 

n,- 

It  is  of  interest  to  know  how  sub-state  x.jeR  contributes  to  the  cost  function  I/. 

The  "ccmponent  aosts"  associated  with  components  x.j , i-1,  ....  n of  the  state  veotop  must  represent  the 
contribution  of  y^\  in  V in  such  a way  that  the  sum  of  all  the  "component  aosts"  equals  the  total  system 
cost  V. 


(2.3) 


For  linear  systems  (2.1)  with  quadratic  costs  (2.2)  this  qualifies  as  a component  cost  definition. 


This  definition  of  component  cost  reduces  to  the  calculation  [4] 

= tr[XC'''QC].^.  , [XcVlii-  eR 


n^xn^ 


0 = XA^  + AX  + DWD^ 


(2.4) 

(2.5a) 

(2.5b) 


where  the  "cost  decomposition"  property  (2.3)  holds. 

It  will  also  be  of  interest  in  Section  5 to  know  how  each  input  w.  and  each  out  y.  contribute  to  the 
cost  function  U. 

The  i?jput  costs  U- ^ {"ouvvut  costs"  1^.-°)  associated  with  w- , i=l,  ...»  i"l»  n ) must  represent 

i‘  1 1 "ijy 

the  contribution  o/ w^  (y^. ) in  V in  such  a way  that  the  sum  of  the  "input-costs"  ("output  costs"  l/^°) 
equals  the  total  system  cost  V. 
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I' 


(2.6) 


' ■ ,r.  v 


(2.7) 


For  linear  systems  (2.1)  with  quadratic  costs  (2.2)  this  qualifies  as  an  "input  cost,"  "output 

cost"  definition, 


I A ,^llyNQ  , 

t.x«  1 


A ..  _.3My|| 


."I  iJi  In 

lim  E( — ' ^ y.) 

3y..  ■'i 


i " ■'  3y. 

t.M>  •'i 


(2.8) 

(2.9) 


These  definitions  reduce  to  the  calculations 


= tr[o''’KDW].  . 

0 = KA  + a\  + C^QC 


l/.°  = [CXC'Q]^.^ 

0 = XA^  + AX  + DWD^ 


(2.10a) 

(2.10b) 

(2.11a) 

(2.11b) 


where  the  cost-decomposition  properties  (2.6),  (2.7)  of  V. I and  hold. 

The  component  cost  , the  input  cost  l/^I,andthe  output  cost  quantize  the  "value"  of  the  state 
x.j , the  input  w,,  or  the  output  y,  while  all  components,  inputs,  and  outputs  are  in-place  and  interacting. 
Indeed,  the  very  definition  of  a dynamic  system  is  described  in  terms  of  "interconnected  components  casually 
related  in  time."  It  is  only  natural,  then  to  describe  the  total  system  behavior  in  terms  of  the  contri- 
butions from  each  of  the  individual  components.  Component  Cost  Analysis  (CCA)  serves  this  purpose  and,  as 
such,  it  should  be  interpreted  as  a diagnostic  tool  to  gain  insight  into  system  behavior. 

If  > Vi  for  all  j^i,  then  the  i^^  component  is  the  critical  performance  limiting  component.  This 
component  might  be  redesigned  to  be  less  dominant  in  the  system  performance.  Such  a "cost-balancing" 
strategy  for  system  design  is  under  consideration.  The  critical  components  from  CCA  also  suggest  which 
components  should  be  made  more  reliable  for  minimizing  the  performance  degradation  in  the  presence  of  the 
"most-likely"  component  failure. 

The  next  Section  3.0  will  exploit  (2.5)  in  its  application  to  the  model  reduction  problem. 

3.0  Application  of  CCA  to  Model  Reduction 

In  Section  2.0  it  was  shown  that  represents  the  in  situ  value  of  x,  in  the  cost  V.  Component  costs 
(2.5)  may  be  computed  in  physical  or  any  other  coordinates.  If  a component  x<  is  deleted  from  (2.1),  the 
cost  V will  not  generally  be  perturbed  by  amount  Vi.  It  will  be  of  interest, therefore,  to  know  in  what 
coordinates  the  cost  perturbation  is  Vi.  In  other  words,  when  do  the  Vi  satisfy  the  "cost-superposition" 
property 


V 


R 


where  Vr  is  the  value  of  V with  n-r  components  deleted  from  the  system  (2.1). 
Definition  1: 


(3.1) 


Components  Xi , i=l,  ...,  n which  satisfy  the  cost-superposition  property  (3.1)  for  alt  1 i r _<  n are 
said  to  be  "cost-decoupled,  components,  " 

Cost-decoupled  components  are  of  obvious  interest  in  model  reduction  problems  since  the  exact 
perturbation  of  V resulting  from  the  truncation  of  n-r  components  can  be  predicted  a priori.  There  are 
many  cost-decoupled  cor^jonents.  Three  different  choices  will  be  discussed  in  what  follows. 


3.1  Modal  Cost  Analysis 


A "component"  Xi  is  called  a "coordinate"  when  ni=l.  Let  A be  diagonal  in  (2.1).  Then  x^,  n.j=l, 
i=l,  .. .,  n are  medal  coordinates.  The  corresponding  component  cost  analysis  (2.5)  is  then  referred  to 
as  Modal  Cost  Analysis  (MCA)  and  the  following  results,  are  known  [3]-[6]. 
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Theorem  1; 


If  A is  diagonal  in  (2.5)  and  if  eithev  (i),  (ii),  or  (iii)  hold 


(i)  d.*Wdj  = 0 , i j 

(ii)  C:|*Qcj  = 0 , i j 

(iii)  -Re  A.,- /I  m A.j  < e 


D - [dp  , d,.,^] 

C = [cp  . . . , 

e arbitrarily  small  > 0 


then 


Id, I 


'W 


-Re  X, 


(3.2) 


Theorem  2:  Modal  Cost  Analysis  of  Lightly  Damped  Systems 

If  the  system  of  interest  is  described  by  the  modal  coordinates  of  a matrix-second  order  system 


2 T 

iii  + M,  n,  = d,  w , i=l N 

N 

y = .1^  (P,n,  + p:^,) 

then  the  modal  costs  associated  with  the  overall  cost  (2.2)  ore 


t/. 

1 


{iip.ii; 


IbillgUl^illw 


1=1, 


(3.3) 


(3.4) 


T T T 

where  the  equality  holds  in  (3.4)  if  (1)  d,  Wdj  = 0,  i/j,  or  if  (ii)  p,  Qpj  + Pi ' QPj  = 0,  i/j,  or 
(iii)  in  the  limit  as  c,  ->■  0. 

The  formulas  (3.2)  or  (3.4)  are  useful  in  determining  critical  modes  of  the  special  class  of 
systems  satisfying  either  (i),  (ii),  or  (iii).  Otherwise,  numerical  solutions  of  (2.5)  are  required  to 
compute  the  modal  costs.  Both  coordinates  described  by  Theorems  1 and  2 are  "cost-decoupled"  in  the 
sense  of  Def.  1.  As  a result,  those  r modes  which  should  be  deleted  in  order  to  minimize  the  cost 
perturbation 


Al/  = V - (3.5) 

are  indicated  by  the  r smallest  values  of  the  modal  costs  1/p 

The  use  of  Modal  Cost  Analysis  for  reduced  order  modeling  of  large  space  structures  is  discussed  in 
[7],  [8].  The  use  of  Modal  Cost  Analysis  for  diagnosing  matrix-second-order  systems  is  presented  in  [9]. 
It  is  clear  from  (3.2)  and  (3.4)  that  the  modal  cost  I/,  is  the  product  of  three  properties  of  a mode: 
the  weighted  observability  ||ci||q,  the  weighted  disturbability  ||d-j||^,  and  the  time  constant  -1/ReX,. 

3.2  Cost-Equivalent  Realizations 

Another  important  set  of  coordinates  which  possess  the  cost-superposition  property  (3.1)  is  presented 
in  this  section.  In  this  set  of  coordinates,  derived  in  [10],  the  matrix  XC^QC  is  diagonal.  Hence,  in 
these  coordinates 


xdqc 


V 


1 


0 


(3.6) 


Theorem  3:  Coordinates  which  satisfy  (3.6)  are  ''cost-decoupled"  in  the  sense  of  definition  1. 

Now  in  order  to  reduce  the  model,  (2.1)  so  as  to  make  the  smallest  possible  perturbation  in  the  cost  I/,  one 
can  rely  on  Theorem  3 to  obtain  the  following  result,  [4]. 

Theorem  4: 

Suppose  the  coordinates  of  (2.1)  are  arranged  so  that  (3.6)  holds  with  1/,  rl  ^2  i • • • i ^n  • 
reduced  model  of  order  r which  minimizes  (3.5)  is  obtained  by  deleting  the  last  nx-r  states  in^{2.\), 
and  the  resulting  cost  perturbation  is  equal  to 


X 

= I I'i 

i=r+l  ^ 
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Al/ 


(3.7) 


Assume  that  C in  (2.1)  has  rank  [C]  = n^,  and  that  Q > 0,  W > 0.  Then,  by  assumption  of  control- 
lability of  (A,D),  (2.5b)  has  a positive  definite  solution  X > 0.  The  assumption  of  stable  A is  added  to 
ensure  that  X is  bounded  from  above.  (However,  CCA  is  not  limited  to  the  analysis  of  stable  systems, 
[12]).  Under  these  conditions,  the  matrix  XCTqc  in  (3.6)  has  rank  in  ny.  Hence,  n^-ny  of  the  in 
(3.6)  are  zero.  This  leads  to  the  following  result,  [10]. 

Corollary  to  Theorem  4; 

Let 


Vr  *^r“ 


^R  ''r’^R 


(3.8) 


vepveBent  the  redueed  order  model  obtained  by  truncating  the  last  Oj^-r  states  of  (2.1).  Then  if  A|^  is 
stable,  (3.8)  is  a "cost-equivalent"  realization  of  (2.1)  (in  the  sense  that  Al/  = 0)  if  r ^ Oy. 

It  remains  to  be  shown  whether  Ap  is  stable.  For  this  task  a particular  set  of  coordinates  are 
described  which  yield  (3.6).  Define  n by  the  Cholesky  square  root  of  X°, 

X°  = (3.9) 


and  e as  the  orthonormal  matrix  of  eigenvectors  of  o^C°^QC°fi.  Hence, the  transformation  of  the  original 
states  x°  having  parameters  {A°,  C°,  D“,  X°1  by  x°  = nex  yields 

X = = I 


C^QC  = e^si^  C“^QC°oe  = 


Hence,  (3.6)  is  satisfied  and  the  states  x.j  qualify  as  cost-decoupled  coordinates.  In  these  coordinates 
the  last  n^-r  states  are  deleted  to  form  the  reduced  order  model  (3.8).  The  following  is  known  concerning 
the  cost-decoupled  coordinates  described  by  (3.10),  (3.11)  and  the  resulting  reduced  order  model  (3.8)  of 
order  r. 

Theorem  5: 

The  cost-decoupled  coordinates  described  by  (3.10),  (3. 11), and  the  reduced  model  (3.8)  have  the 
following  properties  [12]. 

(a)  ^ 0 for  all  i=l , . . . , n 

(b)  ReXi[Ap]  £ 0 r ^ ny 

(c)  ReX.j[Ap]  < 0 controllable 

(d)  Al/  = I I/. 

i=r+l  ^ 

(e)  Al/  fs  minimized  by  (3.8)  for  any  given  r 

(f)  Al/  = 0 i/  r ^ n^  and  (c)  holds 

(g)  lini  E y(t)  y^(t)  = lim  Ey^(t)yJ(t). 

t-X» 

Property  (f)  gives  the  conditions  under  which  (3.8)  is  a "cost-equivalent  realization"  (CER)  of  (2.1). 
This  result  shows  that  Kalman's  minimal  realization  is  not  necessarily  minimal  with  respect  to  cost 
functions.  That  is,  if  the  pair  (A,D)  is  controllable  and  (A,C)  is  disturbable  then  (2.1)  is  minimal  in 
the  sense  of  Kalman.  However,  (3.8)  can  be  a lower  order  equivalent  realization  of  (2.1)  in  the  sense  of 
equivalent  cost  V = l/p.  According  to  the  examples  given  in  [4],  the  order  of  CER  (3.8)  is  usually  r=n  . 
There  is  a known  exception,  however.  ^ 

Theorem  6 [12]: 

Th.e  CER  is  of  order  r > Oy  if  the  first  Markov  parameter  of  (2.1)  is  zero  (CD=0). 
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(3.10) 


(3.11) 


A "stochastic  equivalent  realization"  (SER)  is  defined  in  [13]  as  any  realization  which  matches  the 
first  two  moments  of  the  output  y(t); 

Ey(t)  = EyR(t)  , Ey(t)y’^(t)  = EyR(t)yJ(t).  (3.12) 

The  CER  has  been  compared  to  the  SER  of  Anderson.  The  significant  results  are  these. 

Theorem  7 : 


Let 


G(S)  = 

describe  the  transfer  matrix  for  (2.1)  and  let 

C(sI-A)'^D 

(3.13) 

Gr(s)  = 

Cr(sI-Ar) 

(3.14) 

describe  the  transfer  matrix  for  (3.8).  Eq.  (3.8)  describes  an  SER  of  (2.1)  if  G(s)  can  be  factored  into 
the  form 


G(s)  = Gp(s)  U(s)  (3.15) 

where  U(s)  describes  an  "all-pass"  network 

U(joo)  WU*(jio)  = W (3.16) 

and  any  SER  is  also  a CER.  For  all  single-input  single-output  systems  the  minimal  order  SER  and  CER  are 
the  same. 


The  SER  can  be  of  lower  order  than  Kalman's  minimal  realization,  and  the  CER  can  be  of  lower  order 
than  the  SER  (except  for  single-input,  single-output  systems).  The  price  paid  for  this  advantage  is  that 
the  CER  matches  only  the  cost  (1/  = 1/r)  and  steady  state  covariance  (property  (g)  of  Theorem  5),  whereas 
the  SER  matches  the  cost  (t^  = l/p)  and  the  covariance  for  all  time  (3.12). 

The  weighting  matrix  Q in  (2.2)  often  contains  some  arbitrariness.  It  is  appropriate  to  ask, 
therefore,  whether  the  component  cost  l/i  is  highly  sensitive  to  the  choice  of  Q or  the  choice  of  C in 
(2.1),  (2.2).  This  sensitivity  result  is  summarized  as  follows. 

Theorem  8: 

3l/.- 

There  is  no  choice  of  coordinates  which  yields  a smaller  value  of  the  sensitivity  | | — j — | [ , 
than  the  cost-decoupled  coordinates  satisfying  (3.10),  (3.11).  3C  QC 

Proof: 


The  proof  relies  on  a result  from  [14]  which  states  that 


3X. 

larllii 


if  x^.  is  an  eigenvalue  of  ii>  and  that 


3X. 

lairl 


= 1 


if  t|;  = 1)1''^.  The  proof  is  concluded  by  noting  from  (3.11)  that  I/,  is  the  i^^  eigenvalue  of  C^QC  and  that 
cTqc  is  symmetric. 


It  may  be  observed  that  the  entire  set  of  cost  decoupled  coordinates  as  defined  by  (3.6)  does  not 
have  the  minimum  sensitivity  result  of  Theorem  8,  since  (a)  l/j  is  an  eigenvalue  of  XCTqC  in  general,  and 
(b)  l/i  is  an  eigenvalue  of  XCTqc  which  is  not  generally  symmetric,  as  it  is  with  the  choice  of  (3.10), 
(3.1lj. 

The  cost-decoupled  coordinates  defined  by  (3.10),  (3.11)  are  not  unique  when  ny  “ rank  [C]  < n.  In 
this  (usual)  case  there  is  some  degree  of  flexibility  in  the  choice  of  n-k  coordinates.  The  coordinates 
are  arranged  in  [15]  to  place  the  states  x^-,  i > k in  order  of  their  degree  of  observability  in  y.  (It 
has  already  been  established  from  Theorem  4 that  the  CER  truncates  some  observable  subspaces,)  The  details 
are  described  in  [15],  and  the  resulting  form  of  the  modified  cost-decoupled  coordinates  is 
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(3.17) 


y = [Cj 


where 


eR 


HyXHy 


, and 


0 0 0]  X 


1 1^12!  I - 1 1*23!  I - •••  - I I 


(3.18) 


The  representation  (3.17)  is  similar  in  structure  to  the  Generalized  Hessenberg  Representation  (GHR)  of 
[16]  with  these  notable  differenoes: 

(i)  Cj  in  (3.17)  is  not  identity 

(ii)  the  transformations  leading  to  (3.17)  involve  the  matrices 
a,  0 in  (3.10),  (3.11),  as  described  in  [15]. 

Thus,  if  a CER  is  desired  of  any  Specified  order  r ^ ny,  the  choice  for  r=ny  does  not  involve  the  modified 
representation  (3.17),  whereas  for  the  choice  r > ny  the  representation  (3.17)  is  useful.  Iri  this  later 
case,  the  CER  will  have  one  property  in  addition  to^cost  equivalence  {V  = 1/r)  : it  will  retain  those 

substates  {x-j,  i>k)  which,  among  the  zero  cost  states  (l/^-  = 0),  are  most  observable  in  the  sense  of  (3.18). 

4.0  Controller  Reduction 

Controller  reduction  using  Component  Cost  Analysis  (CCA)  is  based  upon  participation  of  the  optimal 
controller  states  in  the  cost  function 


V = 


(4.1) 


The  optimal  controller  for 

X = Ax  + Bu  + Dw 

y = Cx 

z = Mx  + V , Ev  = 0,  Ev(t)^(T)  = V6(t-T) 

is  described  by 

X = (A  + BG  - FM)  X + Fz 

u = Gx 


(4.2) 


(4.3a) 


where 

(4.3b) 
(4.3c) 

Now  delete  a subset  of  the  states  of  (4.2)  to  obtain  the  reduced  controller  of  dimension  r 

Xp  = (A  + BG  - FM)^Xp  + FpZ  , (A  + BG  - FM)^ 

“r  " ‘"R  ’‘r 


(Ap  + BpGp  - F|^Hp) 

(4.4) 


G = -R'^B^K  , 0 = KA  + A^K  - KBR'^b\  + C^QC  , 
F = PM^V^  , 0 = PA^  + AP  - PM^V'^MP  + DWD^  . 
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where  matrices  with  subscript, R have  a specified  set  of^rows  and/or  columns  deleted,  corresponding  to  the 
deleted  states,  x..  To  determine  the  participation  of  x.  in  (4.3a)  consider  that  the  plant  (2.1)  and  the 
controller  (4.3)  together  form  the  closed  loop  optimal  System  with  cost  1/  given  by 


where  X satisfies 


P = P + 1/ 

y u 

I'y  - lim  E||y|ln  = EilCxllo 

t-W  ^ t-X»  ^ 

= lim  E|  |u|  ip  = lim  E|  |Gx| 

t-HO 

0 = X(A+BG)^  + (A+BG)X  + FVF^ 


(4.5a) 

tr[XC^QC] 

(4.5b) 

tr[XG^RG]  , 

(4.5c) 

(4.5d) 

Since  only  controller  states  x.j  (and  not  plant  states  x,j)  are  considered  for  truncation  in  the  controller 
reduction  problem,  the  controller-cost  decoupled  (CCD)  coordinates  are  defined  as  follows. 


Definition  2: 

Ccntroller-oost-deooupled  (CCD)  coordinates  are  defined  by  the  cost-superposition  property  (3.1) 
for  the  system 


X = (A+BG)  X + Fv 
u = Gx 


(4.6) 


As  a result  of  this  definition,  (4.5c),  (4.5d),and  (4.6)  reveal  that  CCD  coordinates  have  the 
property  ‘ 


[XG^RG] 


¥ 


1 


•0 


mj  - rank  G 


(4.7) 


The  coordinates  satisfying  (4.7)  are  not  unique  and  for  reasons  similar  to  Section  3,  (4.7)  will  be 
satisfied  by  the  choice 


G^RG 


■ 0 


(4.8) 


(4.9) 


Hence, these  properties  hold  for  the  CCD  coordinates  of  (4.6). 
Theorem  9: 


(a)  The  reduced  order  system  of  order  r 


'^R  ° ^^R'^^R^r'  ^R  * ''r'' 
“r  ^ ‘"r^'r 


(4.10) 


A 2 2 

is  a cost-equivalent  realization  of  (4.6)  (i'y  = ^I  I^rNr  “ '^Up*  vthere  l/y  is  calculated 

from  (4.6)  and  from  (4.10)),  if  r^mj^  and  (Ap+BpGp)  is  stable. 

In  addition,  (4.6)  and  (4.10)  have  these  properties: 

(b)  t/i(x)>0 

(c)  ReAi[Ap+BpGp]  _<  0 f / r ^ m^ 

(d)  Re,\,j [Ap+BpGp]  < 0 <=>  (Ap+BpGp,  Fp)  is  controllable 
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(e)  aV  - 1/  V is  minimised  by  (4.10)  for 

u u Up 

(f) 


ly  gvoen  r. 

lim  E u(t)  u^(t)  = lim  E Un(t)ul(t)  if  (d)  holds.  (In  this  case  the  reduced  system  (4.10) 
4-  4.  K K 


t-X» 

an  Energy  Equivalent  of  (4.6).  1 


(g)  If  the  first  Markov  parameter  of  (4.6)  is  zero  (GF  = 0),  property  (f)  holds  only  if  r > m 

(h) 


r 


3l^,-(x) 

I 1 I 


i T 

3G'RG 


is  minimized  by  the  ohoioe  of  coordinates  (4.8),  (4.9). 


Proof : 

By  the  substitution  of 


A 

0 

C 

X 

Q 

y 


A+BG 

F 

G 

X 

R 

u 


(4.11) 


all  of  the  results  of  Section  3 as  applied  to  (2.1)  and  (3.8)  now  apply  to  (4.6)  and  (4.10).  Hence, 
Theorems  5,  6,  and  8 readily  yield  the  above  results  (a)-(h)  by  use  of  the  dual  (4.11). 


The  next  logical  step  is  to  put  (4.10)  in  the  modified  controller  cost-decoupled  coordinates  suggested 
by  (3.17).  These  details  need  not  be  repeated  since  the  dual  (4.11)  has  been  established.  The  modified 
controller-cost-decoupled  coordinates  to  be  used,  therefore,  employ  the  Hessenberg-type  structure  (3.17) 
appl  ied  to  (4. 10) . 

The  important  conclusion  concerning  component  cost  analysis  is  applied  to  open  or  closed  loop  systems 
is  this;  even  though  a realization  (A,D,C)  may  be  minimal  in  the  sense  of  Kalman,  there  may  still  be 
states  xi  in  the  realization  which  do  not  contribute  to  the  cost  function  V.  The  theory  of  cost  equivalent 
realizations  (CERs)  identifies  those  states. 


5.0  Input/Output  Perturbations 

The  input  (output)  costs  ^1^(1/^°)  of  Section  2 describe  the  contribution  of  the  i'"^  input  w,  (the  i^^ 
output  y.()  in  the  performance  matrix  (2.2).  The  following  are  properties  of  and  P.j°. 


Theorem  10:  Input/Output  Cost  Analysis  [17],  [18] 

For  the  system  described  by  (2.1),  (2.2)  with  input  (output)  costs  P.^(P.°)  given  by  (2.9)-(2. 11) : 
"w  "y 

1.  i/=  I 1/  > ■ = I I/  O 

i=l  ’ i=l  ’ 

2.  I/.  ^ 0 if  W^.  j 0 for  all  j=l , . . . , m 

3.  l/^.°  ^ 0 if  ^ 0 for  all  j=l,  ...,  k 

4.  l/^.  I > 0 if  (A,C)  is  observable  and  (2)  holds. 


5.  1/^.0  > 0 if  (A,D)  is  controllable  and  (3)  holds. 

6.  The  input  cost  l/.j  and  the  output  cost  l/.j°  are  invariant  under  state  transformation 

M. 

7.  Tfie  actual  perturbation  of  the  cost  V due  to  the  deletion  of  w^  is 


^‘'i'  = 21/^  - d*Kd.W.. 


and  - l/^.  ^ if  W is  diagonal. 


8.  The  actual  perturbation  of  the  cost  V due  to  the  deletion  of  y. 


Al/i»  = 21/.°  -c.  Xc.Q.. 


and  Al/^. ° = I/. ° if  Q is  diagonal. 

Theorem  10  suggests  the  solution  to  the  following  input  (output)  selecting  problem. 
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The  Input  (output)  Selection  Problem: 

From  an  admissible  set  of  m inputs  (k  outputs)  in  (2.1)  find  the  set  o/  m < m inputs  (k  < k outputs) 
ahioh  cause  the  smallest  perturbation  in  the  cost  V in  (2.2). 

The  solution  to  this  problem  is  obtained  by  the  deletion  of  those  m - lii  inputs  (k  - k outputs)  having 
the  smallest  in  Theorem  10.  In  [17]  it  was  shown  that  this  approach  turns  out  to  be  identical 

to  the  results  of  the  integer  programming  approach  [19]  for  solving  the  output  selection  problem. 

The  closed-loop  application  of  the  input/output  cost  analysis  for  actuator/sensor  selection  is 
treated  in  [18],  [20],  and. in  the  interest  of  brevity  these  results  are  not  summarized  here.  The 
distinctions  to  be  noted,  however,  are  that  the  closed  loop  ICA/OCA  algorithm  is  iterative,  unlike  the 
open-loop  version  suggested  by  Theorem  10.  The  required  iteration  is  due  to  the  fact  that  the  selection 
of  inputs  and  outputs  changes  the  closed-loop  plant  matrix,  unlike  the  open  loop  version  (2.1),  (2.2). 
Further  research  is  required  to  describe  bounds  on  cost  perturbations  in  the  closed-loop  situation. 

6.0  Conclusions 

Concepts  of  cost-decomposition  are  described  to  treat  problems  where  structural  perturbations  in 
inputs,  outputs,  and  states  may  occur. 

The  input-cost  analysis  (ICA)  and  output  cost  analysis  (OCA)  determine  the  inputs/outputs  making  the 
largest  contribution  in  a quadratic  cost  function  of  the  outputs.  The  component  cost  analysis  (CCA) 
determines  the  states  making  the  largest  contribution  in  a quadratic  cost  function  of  the  outputs.  In 
this  setting  it  is  shown  that  even  though  a realization  (A,B,C)  may  be  minimal  in  the  sense  of  Kalman, 
it  may  contain  states  which  make  no  contribution  to  the  quadratic  cost  function  of  the  outputs.  A theory 
of  cost-equivalent  realization  (CER)  is  developed  to  find  and  delete  those  innocuous  states  making  no 
contribution.  In  the  open-loop  the  CER  theory  is  used  for  model  reduction.  In  the  closed-loop  the  CER 
theory  is  used  for  controller  reduction. 


References 

[1]  Saksena,  V.P.,  and  Kokotovic,  P.V.,  "Singular  Perturbations  in  Control  Theory",  Bibliography 

prepared  for  IFAC  Workshop  on  Singular  Perturbations  and  Robustness  of  Control  Systems,  Ohrid, 
Yugoslavia,  July  1982.  "" 

[2]  Kokotovic,  P.V.,  O'Malley,  R.E.,  Jr.,  and  Sannuti,  P.,  "Singular  Perturbations  and  Order 
Reductions  in  Control  Theory  - An  Overview",  Automatica  12,  pp.  123-132,  1976. 

[3]  Skelton,  R.E.,  "Cost  Decomposition  of  Linear  Systems  with  Application  to  Model  Reduction",  Int. 
J . Control , vol . 32,  no.  6,  pp.  1031-1055,  1980. 


[4]  Skelton,  R.E.,  and  Yousuff,  A.,  "Component  Analysis  of  Large  Scale  Systems",  Control  and  Dynamic 
Systems,  Academic  Press,  vol.  18,  1982,  ed.  C.T.  Leondes. 

[5]  Skelton,  R.E.,and  Gregory,  C.Z.,  "Measurement  Feedback  and  Model  Reduction  by  Modal  Cost  Analysis", 
JACC,  Denver,  1979. 

[6]  Skelton,  R.E.,  "Cost  Sensitive  Model  Reduction  for  Control  Design",  AIAA  Guidance  and  Control  Conf., 

Palo  Alto,  Ca.  1978.  

[7]  Skelton,  R.E.,  "Control  Design  of  Flexible  Spacecraft",  Agardograph  No.  251  Theory  and  Applications 
of  Optimal  Control  in  Aerospace  Systems,  ed.  I.P.  Kant,  1981.  Distributed  by  NASA  Langley  Va.  23365, 
attn : Report  Distribution  and  Storage  Unit. 

[8]  Skelton,  R.E.,  Hughes,  P.C.,  and  Hablani,  H.,  "Order  Reduction  for  Models  of  Space  Structures  Using 
Modal  Cost  Analysis",  AIAA  J.  Guidance  and  Control,  to  appear  1982. 

[9]  Skelton,  R.E.,  and  Hughes,  P.C.,  "Modal  Cost  Analysis  for  Linear  Matrix-Second-Order  Systems", 

J.  Dynamic  Systems,  Measurement,  and  Control.  Sept.  1980,  yol . 102. 

[10]  Skelton,  R.E.,  "Component  Cost  Analysis  of  Linear  Systems",  IEEE  ISCAS  Proceedings.  April  1981. 
Chicago. 

[11]  Skelton,  R.E.,  "Partial  Realizations  of  Linear  Stochastic  Time-Varying  Systems",  1980  Allerton 
Conf.  on  Computing  and  Control.  Monticello,  II. 

[12]  Yousuff,  A.,  and  Skelton,  R.E.,  "Cost  Equivalent  Realizations  of  Stochastic  Processes". 

[13]  Anderson,  B.D.O.,  "The  Inverse  Problem  of  Stationary  Covariance  Generation",  J.  Stat.  Phys.,  vol. 

1,  no.  1,  1969.  — 

[14]  Skelton,  R.E.,  "Root  Sensitivity  in  Linear  Systems  Analysis  and  Design",  submitted  for  publication.. 

[15]  Yousuff,  A.,  and  Skelton,  R.E.,  "Controller  Reduction  by  Component  Cost  Analysis",  submitted  for 
publication. 

[16]  Tse,  E.C.Y.,  Medanic,  J.W.,  and  Perkins,  W.R.,  "Generalized  Hessenberg  Transformations  for  Reduced 
Order  Modeling  of  Large  Scale  Systems",  Int.  J.  Control . vol.  27,  no.  4,  1978. 


[17]  Chiu,  Di.,  and  Skelton,  R.E.,  "Selecting  Measurements  and  Controls  in  LQG  Problems",  IEEE  CDC,  Dec. 
1981,  San  Diego. 

[18]  Chiu,  Di.,  "Optimal  Sensor/Actuator  Selection  Number,  and  Placement  for  Linear  Stochastic  Systems", 
Purdue  University  Technical  Report  prepared  under  Contract  No.  955369  for  the  Jet  Propulsion  Lab., 
May  1981. 

[19]  Chen,  W.H.,  and  Seinfeld,  J.H.,  "Optimal  Location  of  Process  Measurements",  Int.  J.  Control,  1975, 
vol . 21 , no.  6. 

[20]  Skelton,  R.E.,  and  DeLorenzo,  M.,  "Selection  of  Sensors  and  Actuators  in  an  LQG  Program",  to  appear 
J.  Large  Scale  Systems. 


475 


This  Page  Intentionally  Left  Blank 


FINITE  DIFFERENCE  NUMERICAL  METHODS  FOR 
BOUNDARY  CONTROL  PROBLEMS  GOVERNED  BY 
HYPERBOLIC  PARTIAL  DIFFERENTIAL  EQUATIONS* 

Gooing  Chen,  Quan  Zheng,  Matthew  Coleman,  and  Sunethra  Weerakoon,  Department  of 
Mathematics,  Pennsylvania  State  University 
University  Park,  PA  16802 


ABSTRACT 


This  pape;r  briefly  reviews  convergent  finite  difference  schemes  for  hyper- 
bolic initial  boundary  value  problems  and  their  applications  to  boundary  control 
systems  of  hyperbolic  type  which  arise  in  the  modelling  of  vibrations.  We  show 
how  these  difference  schemes  are  combined  with  the  primal  and  the  dual  approach- 
es to  compute  the  optimal  control  in  the  unconstrained  case,  as  well  as  the  case 
when  the  control  is  subject  to  inequality  constraints.  Some  of  our  preliminary 
numerical  results  are  also  presented. 


§1.  Introduction 

Many  problems  in  controlled  vibration  can  be  modelled  by  a first  order 
hyperbolic  partial  differential  system  of  the  following  type 


(1.1) 


^■^y(x,t)  =A-^y(x,t)  +By(t)  +f(x,t),  0SxSl,0<t5T, 

y(x,0)  = YqCx)  , OS  X S 1, 

Tr 

where  the  state  y(x,t)  = (y^^Cx, t)  , . . . ,y  (x,t))  is  a vector  function  of  (x,t) 
and  without  loss  of  generality  it  is  assumed  that  A is  a constant  diagonal  matrix 
of  the  form 


(1.2)  A 


A 0 


0 A 


+ 


, A = 
n X n 


0 


, A 


£+1 


(n-il)x(n-jl) 


-•ixi 


a^  - a. 


5 a • 
n 


B = [b  ,,  1 is  cin  n X n constant  matrix.  Associated  with  the  decomposition  of  A, 

:ik 


Supported  in  part  by  National  Science  Foundation  Grant  MCS  81-01892 
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B can  be  written  as 

B = 


The  boundary  conditions  and  control  are 

(1.3)  y (0,t)  = S^y"''(0,t),  y"'’(l,t)  = S^y  (l,t)  + Cu(t)  , u E control, 

where  y = (y^j...,yj^)  and  y = » • • • defined  according  to  the 

partition  of  y and  S^,  S^  and  C are  constant  matrices  of  size  I x (n-Jl)  , (n-£) 

X a,  (n-i)  X m,  respectively.  We  wish  to  minimize  a quadratic  cost 

(1.4)  J(y,u)  E/q  [/q  <y(x,t)  , W(x)y(x,t)>dx  + <u(t)  , ]N(t)u(t)>]dt, 

where  the  matrix  ]N  is  positive  definite,  and  IW  is  positive  semi-definite.  The 
above  cost  is  associated  with  the  vibration  energy  and  the  control  cost  of  the 
system.  Quadratic  regulator  problems  of  this  and  similar  type  have  been  consid- 
ered in  [1],  [4],  wherein  the  main  emphasis  is  on  the  synthesis  of  optimal  con- 
trols and  existence  proofs  of  solutions  of  Riccati  equations. 

Numerical  analysis  of  optimal  control  problems  governed  by  ordinary  differ- 
ential equations  has  been  very  well  studied.  Nevertheless,  for  control  systems 
governed  by  hyperbolic  partial  differential  equations,  there  has  been,  to  our 
knowledge,  very  little  systematic  Investigation  on  the  numerical  methods  for 
such  problems.  In  this  presentation  we  wish  to  make  a preliminary  study  of  this 
topic . 

§2.  Smoothness  of  State  and  Control 

Before  one  conducts  any  numerical  study,  it  is  important  to  know  the  answers 
to  the  following  basic  questions. 

(Ql)  In  what  sense  is  the  partial  differential  equation  (1.1),  (1.3)  well-posed? 

(Q2)  Does  the  optimal  regulator  (1.4)  subject  to  (1.1)  and  (1.3)  have  a unique 
optimal  control  u? 

(Q3)  How  smooth  is  the  control  u and  the  corresponding  state  y? 

(Ql)  can  be  answered  in  terms  of  the  following  theorem  [2,  Theorem  2.3] 
Theorem  2.1 

For  any  f € L^ ( (0 , 1) x (0 ,T) ) , y € L^(0,1)  and  u € L^(0,T),  the  solution  y 

of  (1.1),  (1.3)  exists  uniquely  in  L ( (0,l)x (0,T) ) . Furthermore,  there  exists 
K > 0,  independent  of  f,  y^  and  y,  such  that 

(2.1)  ||y||%  5 K(|ly  li^  +||u||2  + ||f||2  ).  □ 

L ((0,l)x(0,T))  ^ L^(0,1)  L (0,T)  L ( (0 , l)x (0 ,T) ) 
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Using  Theorem  2.1,  Duhamel's  formula  and  the  standard  primal  theory  (cf. 
[5],  e.g.) , we  easily  answer  (Q2)  in 

Theorem  2 ■ 2 

2 2 

Assume  that  f € L ( (0 , 1) x (0 ,T) ) , y^  € L (0,1).  Then  the  optimal  regulator 
problem  (1.4),  (1.1)  and  (1.3)  has  a unique  optimal  control  u.  □ 

In  fact,  [2,  Theorem  2.3]  gives  an  a priori  bound  which  is  even  stronger 
than  (2.1)  and  can  probably  be  used  to  prove  existence  of  optimal  controls  on 
[0,“). 


Theorems  2.1. and  2.2  only  guarantee  the  minimal  regularity  of  the  optimal  u 
and  the  corresponding  state  y.  Assume  that  the  inhomogeneous  forcing  term  f in 

(1.1)  is  C^  continuous  in  [0,l]x[0,T].  One  can  solve  (1.1)  by  the  method  of 
characteristics.  In  fact,  the  k-th  component  y^  of  y satisfies  a first  order 
linear  ordinary  differential  equation 

(2.2)  ^ .t)  = E b^y  (x^(t),t)  + f^(x^^(t)  , t)  , k=l,2,...,n, 

j=l  ^ ^ 

along  the  characteristic  curve  x = ^^(t) > along  which  is  satisfied 


(2.3) 


x^(t)  — k— 1 , 2 , . . . ,n. 


where  f in  (2.2)  is  the  k-th  component  of  f. 

K. 

Assume  now  that  the  initial  condition  y^  is  piecewise  smooth  on  [0,1]  and 
u is  also  piecewise  smooth  on  [0,T].  One  can  solve  y by  converting  (2.3)  into 
a system  of  Integral  equations.  Such  a system  has  a solution  y (as  the  fixed 
point  of  an  integral  operator)  which  is  easily  seen  to  be  also  piecewise  smooth 
in  [0,l)x[0,T].  Since  y depends  linearly  on  f»yQ  and  u,  we  write 


(2.4)  y = L(f ,Yq,u) , 

where  L is  an  integral  operator.  Substituting  (2.4)  into  (1.4)  and  minimizing 
with  respect  to  u,  we  get  another  new  integral  equation  for  u.  This  equation 
has  a unique  solution  u which  is  easily  seen  to  be  piecewise  smooth  on  [0,T] . 
Thus  we  state 

Theorem  2.3 

Assume  that  f S C^( [0,1] x[0,T])  and  y^  is  piecewise  smooth  on  [0,1].  Then 
the  optimal  control  u of  (1.4)  is  piecewise  smooth  on  [0,T].  ° 

oo 

In  Theorem  2.3,  even  if  y^  is  assumed  to  be  in  C ([0,1]),  the  smoothness  of 
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of  u will  not  necessarily  be  improved  to  C ([0,1]).  For  u to  be  C ([0,T]), 
compatibility  conditions  at  (x,t)  = (0,0)  and  (x,t)  = (1,0)  must  be  met: 
necessarily. 


(2.5) 


Yo(0)  = S^yJ(O).  yj(l)  = S2yQ(l)  + Cu(0). 


A X X 

For  u to  be  C ([0,T]),  one  must  further  have  y^  € C ([0,1])  and 

- ^^0  - - - + - + "^^0  + - + + 


(2.6) 

A+ 


+ f (0,0)] 

I *^y 

(1)  + B+y“(l)  + B+yJ(0)  + f^(l,0)  = S2[A~  ^ (1)  + B"y'(l)  + B^yJ(l) 


+ f (1,0)]  + C ^ (0)  . 


In  practice,  we  find  that  u ? C ([0,T])  is  probably  the  best  we  can  get. 
Thus  if  (2.5)  is  satisfied,  the  corresponding  state  y will  be  c0( [0 , 1 ] x [0 ,T ] ) 
smooth,  but  not  C^ ( [0 , 1 ] x [ 0 ,T] ) . Thus,  discontinuities  of  first  order  partial 
derivatives  of  y will  propagate  along  characteristics  (2.3). 


§3.  Finite  Difference  Approximations 

The  main  objective  of  this  section  Is  to  Introduce  several  commonly  used 
stable  finite  difference  schemes  and  different  ways  of  defining  boundary  condi- 
tions . 


In  computing  the  optimal  control  and  the  governing  partial  differential 
equation,  one  introduces  a time-step  k > 0,  a spatial  mesh  width  h > 0 by 
dividing  the  [0,1]  interval  into  ^ segments,  then  approximates  the  partial 

differential  equation  (1.1)  by  a consistent  single  or  multi-step  difference 
scheme 


(3.1) 


(I  + Q^)w^(t+k)  = (I  + Q^)w^(t)  + kf^(t),  t 
w(x^,0)  = Wq(x^),  V = -r+l,-r+2 , . . . ,N+p-l, 


pk , P-  — 0,1,2,...,  J 

V = -r+1 , -r+2 , . . . ,N+p-l 


where  the  notations  [2,3] 


X = vh 

V 


w^(t)  =w(x^,t),  f^(t)  = f(x^,t), 

P i -i  1 i 

Q,  s 2 A.E-^  , i=0,l;  A.  5 A.(h,k)  are  constant  n x n matrices; 

^ j=-r  ^ 31 
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Ew^  s i.e.,  E is  the  right  translation  operator; 


are  adopted.  The  initial  condition  in  (3.1.2)  is  determined  by  (3.1.2)  and 
the  equation  (1.1),  and  the  boundary  conditions  (1.3)  are  discretized  into 


(3.2) 


w (t+k)  = (t)  , 

w^(t+k)  = R^^^wj^_j^(t), 


p =-r+l , . . . , 0 , 
p = N,N+1, . . . ,N+p-l, 


where 


r(H) 


q 

z 


j=o 


J 


We  define  the  central,  forward  and  backward  difference  operators,  respectively, 
as 


The  most  often  used  finite  difference  schemes  are  listed  below: 

(L-W)  Lax-Wendrof f : 

,2 

y(x,t+k)  = y(x,t)  + k(AD^+B)y (x, t)  + -y  ^ D_^D_y(x,t)  + kf(x,t); 

(L.F.)  Leap  Frog: 

y(x,t+k)  = y(x,t-k)  + k(2ADQ+B)y (x, t)  + kf(x,t); 

(C-N)  Crank-Nicolson: 

(I  - y ADQ)y(x,t+k)  = (I  + ADQ+kB)y(x,t)  + kf(x,t); 
for  X = h, 2h, . . . , (N-l)h. 

Note  that  although  L.F.  is  a multi-step  scheme,  it  can  be  written  as  a single 
step  scheme  (3.1)  by  doing  as  in  [3,  p.  62].  All  the  above  three  schemes  are 
accurate  of  order  (2.2)  [3,  pp . 54,62].  In  addition,  the  L-W  scheme  is  dissipa- 
tive of  order  4[3,  p.  62].  The  L.  F.  and  C-N  schemes  are  non-dissipative,  but 
they  can  be  made  dissipative  by  adding  some  small  "higher  order"  terms. 

The  stability  theory  of  difference  schemes  for  purely  initial  value  problems 
is  very  well  understood.  The  question  of  stability  can  usually  be  answered  by 
the  standard  Fourier  transform  technique.  For  initial-boundary  value  problems, 
however,  their  stability  theory  is  not  as  well  developed  as  its  counterpart  for 
purely  initial  value  problems.  The  difficulty  lies  in  the  fact  that  boundary 
discretizations  also  affect  the  overall  stability  in  a very  im.portant  way  — a 
boundary  discretization  may  be  stable  for  one  scheme  but  may  not  be  stable  for 
another.  To  determine  the  stability  of  the  overall  finite  difference  scheme, 
one  must  analyze  the  eigenvalues  of  the  resolvent  operator  induced  by  the 
difference  schemes  as  in  [2].  Following  [2],  assume  that  f = 0 in  (1.1)  and  let 
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(3.3)  (hD_^)^v?Q(t)  = 0,  (hD_)^w^(t)  =0,  j = a natural  number, 

(3.4)  WQ(t+k)  = w^(t)  + kA^D_|_WQ(t)  , w^(t+k)  = w^(t)  + kA  D_w^^(t)  , 

(3.5)  WQ(t+k)  + w^(t+k)  - kA’*’D_^WQ(t+k)  = Wq(i)  + w^(t)  + kA'''D_|_WQ(t)  , 

w^(t+k)  + Wj^_^(t+k)-kA  D_Wj^(t+k)  = w“(t)+  Wj^_]^(t)  + kA”D_w”(t)  , 

(3.6)  ( = 0,  ( =0  , j = a natural  number, 

( = u^^^(t-k)-u^(t)  , A_u_^(t)  = u^_^(t-k)-u^(t))  . 

In  [2,  p.  671],  it  is  suggested  that  the  above  boundary  discretizations, 
along  with  (1.3),  when  combined  with  the  L-W,  L.F.  or  C-N  schemes  on  the  inter- 
ior nodes,  yields  stable  approximations  with  no  exponentially  growing  eigensolu- 
tions’’^*  in  the  following  cases: 

Boundary  discretization  Approximations  with  no  exponentially  growing 
eigensolutions  


(3.3) , j = 1 

L-W; 

C-N  for  N even 

(3.3),  j = 2 

C-N 

for  N even 

(3.3),  j = 3 

L-W; 

C-N  for  N even 

(3.4) 

L-W; 

L.F.  ; C-N 

(3.5) 

L-W 

for  N odd;  L.F.  , C-N 

(3.6) 

L.F. 

for  N even;  C-N  for  N even 

It  is  also  pointed  out  in  [2]  that,  e.g.,  for  L-W,  using  (1.3)  and  (3.3) 

with  j = 2 or  3,  the  finite  difference  scheme  is  stable  but  has  exponentially 

growing  eigensolutions.  However,  such  exponential  solutions  can  be  avoided  by 
adding  two  more  dissipative  boundary  conditions 

(l-h^D^)wQ(t)  =WQ(t),  (l-hV)w^(t)  =Wj^(t). 

A standard  consequence  here  is  that  stability  plus  consistency  yields  con- 
vergence (for  T < ■=°)  . This  is  mentioned  in  [2]  without  proof.  Meanwhile,  it 

: This  is  justified  for  a specific  (but  quite  standard)  example  only.  Its 

general  validity  remains  to  be  investigated. 

:*f  A 

: Those  eigensolutions  generate  large  errors  as  T “. 

: The  stability  eigenvalue  analysis  was  carried  out  by  computer  calculations 

(instead  of  a rigorous  proof) . 
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should  also  be  mentioned  that  although  the  accuracy  of  any  of  L-W,  L.F.  and  C-N 
the  interior  nodes  is  (2,2),  the  overall  accuracy  (i.e.,  taking  into  account  of 
the  boundary  discretization)  still  needs  careful  analysis.  Rates  of  convergence 
are  then  determined  by  the  accuracy  of  the  overall  finite  difference  scheme. 


§4.  Computational  Approaches  to  the  Quadratic  Regulator  Problem  (I):  Primal 

and  Dual  Methods  for  the  Unconstrained  Case 

Once  a finite  difference  scheme  is  chosen, it  is  a rather  straightforward 
matter  to  compute  the  optimal  control  u and  the  corresponding  state.  We  first 
replace  the  quadratic  cost  integral  (1.4)  by  a suitable  quadrature 

(4.1)  J^j^(w,u) 


where  < , and  < , are  inner  products  in  appropriate  Euclidena  spaces, 

and  {H. . ,H„}  are  weights  of  quadrature.  For  simplicity,  we  assume  that  r = 1 
ij  t 

and  p = 1 in  (3.1) . 


Z H.  . 

0<i<N 

Z- 

“k 


< w^(jk)  ,W^w^(jk) 


> ^ 


Osf: 


Osjr 


-1 

~k 


u(fk) ,N^u(^k)  >2» 


Since  it  is  a common  practice  in  the  method  of  finite  differences  to  choose 
equal  grids  in  the  space  variable  and  in  the  time  variable,  one  is  forced  to  do 
the  same  for  the  quadrature  in  (4.1).  This  has  the  disadvantage  of  decreasing 
the  accuracy  of  the  entire  problem  since  such  quadrature  accuracy  is  limited 
only  to  Cl(h+k)  . We  are  still  trying  to  Improve  the  accuracy  for  (4.1). 

We  now  use  (3.1)  and  (3.2)  to  represent  w as 


(4.2)  w^(t)  = Z u(jk)  + F (t),  v=0,l, 

t ^ 


,N“1,N;  t=p.k  for  p-=0, 1 , 2,  , . . , 


where  F^(t)  denote  the  sum  of  all  inhomogeneous  terms  depending  on  the  initial 
condition  yQ  and  the  inhomogeneous  term  f in  (1.1),  and  the  coefficients  . are 

determined  iteratively  from  the  related  coefficients  in  (3.1)  and  (3.2).  ^ 

For  explicit  schemes  (Q^sO)  such  as  L-W  and  L.F.,  the  iterative  computations  of 
a^j  can  be  directly  carried  out  on  a computer.  For  the  C-N  scheme  which  is 
implicit,  an  additional  matrix  inversion  (I+Q^) “^(I+Q^)  is  needed  at  each  time 
step  in  the  iterative  computations  of  . 


Now,  we  substitute  (4.2)  into  (4.1).  This  yields  a quadratic  expression 
J,,  depending  on  u(0) , u(k) , u(2k) , . . . ,u(T)  only: 

T/k  T/k 

(4.3)  J =-  Z < u(jk)  ,D.  ,u(^k)  + Z <Y.,u(jk)  > + 'C. 

.^0  j 

To  compute  the  optimal  discrete  control  u(0) ,u(k) , . . . ,u(T) , we  set 


on 
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0u(jk)  '^hk 

this  gives  a linear  matrix  equation 

= e 


u(jk) 


= 0,  j=0,l. 


’ k ’ 


\k\k 


hk 


^hk  ^ (u(0),u(k),...,u(T)), 


T T T 

where  M^  is  a full  (-^^  + 1)  x ^ 1)  stiffness  matrix,  and  6^  is  a (^  + 1) 

vector  depending  on  the  initial  condition  and  inhomogeneous  forcing  term  f 
only.  Thus 

hk 

We  state  the  following  theorem  without  proof. 

Theorem  4.1 


,0, 


Assume  that  the  scheme  (3.1)  and  (3.2)  be  convergent.  Let  f € C ([0,1] 

X [0,T])  and  let  y^  be  piecewise  continuous  on  [0,1].  Then  u^]^  as  obtained  in 
(4.3)  converges  to  u,  the  unique  optimal  control  of  (1.4),  (1.1),  (1.2),  in 
l2(0,T).  □ 


Next,  we  discuss  the  duality  approach:  we  consider  the  max -min  problem 


(4.5) 

where 

(4.6) 

and 

(4.7) 


max  min  [J^,  (w,u)  + 2 < p ( jk)  , CONSTRAINT  >]  , 

hk  rp 

P W,U  K^<i 

-L-J-k 

0SV£N 


CONSTRAINT  = 


(3.1.1)  if  (x  ,t)  = (vh,jk)  is  an  interior  node, 

(3.2.1)  or  (3.2.2)  if  (x^» t)  = (vh,jk)  is  a boundary  node. 


-15VSN+1,  ISjS^, 


P^(jk)  = p(vh,jk) 

are  the  discrete  dual  variables  defined  at  each  nodal  point. 

Problem  (4.5)  is  equivalent  to  the  primal  problem  (4.1),  (3.1)  and  (3.2). 

In  (4.5),  for  each  given  {p^(jk)},  we  first  minimize  with  respect  to  w^  and 
u(jk) , obtaining  minimizing  solutions 

(4.8)  w. 

(4.9) 


3 .P 


:J;(p) 

, 0 S V < N, 

2 . . 

T 

3 

. 0 S 
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where  in  the  above 
dual  variable  p. 
unless;  the  matrix 


r-,1  ^2 

, and  {S.}  are  affine  linear  operators  of  the  discrete 

The  relation  ^(4.9)  is  explicit , but  (4.8)  is  usually  implicit 
W in  (1.4)  is  positive  definite. 


Substituting  (4.8)  and  (4.9)  into  (4.5)  for  all  the  w,u,  we  obtain  a new 
quadratic  functional  Ihi^(p)  of  the  dual  variable.  We  maximize  Ij^j^(p)  with  re- 
spect to  p and  obtain  which  is  unique  provided  that  W is  positive  definite. 
From  (4.9)  and  (4.8),  we  obtain  the  discrete  optimal  control 


(4.10)  u(jk)  = S^(p) 

and  the  corresponding  discrete  state 


(4.11) 


/S 

W 


V 


§5.  Computational  Approaches  to  the  Quadratic  Regulator  Problem  (II): 

Quadratic  Programming  Methods  for  the  Constrained  Case 

Suppose  that  the  class  of  admissible  controls  is  constrained  by  linear 
Inequalities 

r Ku(t)  S g(t) , 

(5.1)  J 0 £ t s T, 

I u(t)  > 0, 

where  K is  a constant  s x m matrix  and  g is  a continuous  s-vector  valued  function 
on  [0,T] ; (5.1)  is  meant  in  the  sense  of  componentwise  inequalities.  We  assume 
that  there  exists  at  least  one  control  u(t)  satisfying  (5.1). 

After  discretizations,  (5.1)  becomes 


(5.2) 


Ku(jk)  S g(jk) 
u(jk)  S 0 


Thus,  from  (4.3),  we  are  faced  with  a discrete  quadratic  programming  problem 

T/k  T/k 


(5.3) 


min  2 <u(jk),D.  .u(-2k)  > + 2 <Y.*u(jk)>  + ^ 

u ^ j,^=0  j=0  J 


u(jk)  subject  to  (5.2)  for  0 < j < 


k 


We  wish  to  use  the  simplex  method  of  P.  Wolfe  [6]  to  solve  (5.3).  Let  u(jk) 
(0<jsT/k)  be  the  unique  minimizing  solution  of  (5.3).  From  the  Kuhn-Tucker 
conditions,  there  exists  a (discrete)  dual  variable  A = iA(jk)|A(jk) 

0 S j £ T/k}  such  that  the  following  relations  hold  simultaneously: 
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(5.4) 

(5.5) 


(primal  feasiblity)  Ku(jk)  < g(jk) , u(jk)  > 0 

. Tr  - - 

(dual  feasibility)  Z D.„u(-&k)  + y.  + K A(ik)  i 0,  X(1k)  2;  0, 

— — I'O 


&=0 


Vj  : 0<js  ^ , 


(5.6)  (complementary  slackness)  X.  (jk)  [Ku(jk)-g(  jk)  ] , = 0,  Vi  : Osi-,£s, 

il  1 1 

T/k 


[ Z D.„u(^k)  +Y.  +K^’"x(jk)].  u.  (jk)  = 0, 

^=0  J ^2  ^2 


Vi^:  0si25m, 

where  the  subscripts  i^  and  i^  denote,  respectively,  the  i^-th  and  i2“th 
component  of  each  vector. 

Now  define  the  slack  variables 


(5.7)  ^(jk)  = g(jk)  - Ku(jk)  € ]R^ 

T /k 

(5.8)  N(jk)  = Z D,„u(^k)  + Y,  + X(jk)  6 Ir“ 

j=o  J 

T f 

for  all  j=0<j<^.  In  the  new  variables  €,  T),  the  complementary  slack- 
ness conditions  (5.6)  become 


(5.9) 

X (jk)C.  (jk)  = 0 
1 1 

0 < i^  < s. 

(5.10) 

jj.  (jk)u.  (jk)  = 0 
2 ^2 

0 S i2  S m. 

The  problem  now  becomes  one  of  finding  a feasible  solution  (u,X,C,r)) 
following  system: 

^Ku(jk)  + C(jk)  = g(jk) 

ZD.„u(^k)  + K'^’^X(jk)  - Ti(jk)  = 

u(jk)  £ 0 

(5.11) 

X(jk)  > 0 

\ «(jk)  £ 0 


Tl(jk)  > 0 

X.  (jk)§.  (jk)  = 0 
1 1 
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u,  (jk)ri  (jk)  = 0 

2 2 

Wj  = 0 S j ~ ~ , Vi^  : 0 S < s,  Vi^  : 0 < 1^  £ m, 

which  is  a linear  programming  phase  I problem. 

One  then  proceeds  to  solve  (5.11)  by  the  simplex  technique  with  the 
restricted-entry  rule. 


§6.  Example:  a Regulator  Problem  Governed  by  the  One-Dimensional  Wave  Equation 

Consider  a quadratic  regulator  problem  governed  by  the  one-dimensional  wave 
equation: 

T 1 2 2 T 

(6.1)  min  / / [-^^(x,t)  + ||-(x,t)  ]dxdt  + P/  u^(t)dt  , 

0 0 at  Q 


subject  to 


2 2 

/ ~ (x,t)  - ^ (x,t)  = 0 , 0£x<l,  0<t<T, 

at  ax 


(6.2) 


z(x,0)  = a(x) 


at 


(x,0)  = P(x),  0 £ X 5 1,  (initial  conditions) 


I z(0,t)  = 0,  — (l,t)  = u(t)  , 0 £ t < T,  (boundary  condition  and 

control) . 


The  quadratic  index  (6.1)  is  associated  with  the  vibration  energy  and  control 
cost,  which  are  to  be  minimized. 


Letting 


(6.3) 


1 . az  , az.  1 .az  az  . 


we  transform  (6.2)  into  a first  order  hyperbolic  equation 


(6.4) 


a 

at 


y^(x,t) 

72 (^>t) 


yj^(x,0) 

72(x,0) 


-1  0 ' 

a 

yj^(x 

,t) 

0 1 

ax' 

72  (x 

,t) 

"h  (x) 

± 

> 

0 

£ X 

< 1 

h2(x) 

2(0, t)  = 

0, 

^2 

(l,t) 

— "S 

, 0£x£l,  OstST, 


\T2 


^/2 
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T 1 1 

/ / [y?(x,t)  + y^(x,t) ]dxdt  + p / u^(t)dt. 

0 0 0 

The  new  form  of  the  problem  agrees  with  (1.1),  (1.3)  and  (1.4)  in  §1. 

Let  T = 1 and  h^(x)  = 0 in  (6.4)  and  (6.5).  Using  the  method  of  character- 
istics, we  find  that  the  optimal  control  is 

-V2(l-t) *h, (1-t) 

u(t)  = 

and  the  corresponding  state  is 
y (x,t)  = I 

'h^(x-t) , t < X, 

rO 

y„(x,t)  = I 

Vh^(2-x-t)  + \/2  u(t+x-l)  , 

for  (x,t)  6 (0,1)  X (0,1). 

We  have  computed  a set  of  examples  using  the  primal  method.  The  Lax- 
Wendroff  scheme  is  adopted,  with  boundary  discretization  (3.4).  The  number 
p is  set  to  1 In  (6.5)  . 


if 


t + x-1  < 0, 
t + x-1  > 0, 


P + 2(l-t) 


t > x. 


with  cost  index 
(6.5) 


In  Figures  1,  2 and  3,  hj^(x)  = sin  ttx  is  used;  the  compatibility  condition 
(2.5)  is  satisfied,  so  the  optimal  control  u is  C^([0,T]).  The  corresponding 
state  y is  C®([0,1]  x [0,1]). 


In  Figures  4,  5 and  6,  h (x)  = 1 (0Sx<0.5)  and  h^(x)  = -1  (0.5<xSl)  is  used; 
none  of  the  compatibility  conditions  (2.5)  and  (2.6)  are  satisfied.  The  optimal 
control  u and  state  y are  both  discontinuous,  but  are  piecewise  smooth. 


In  each  of  the  above  two  cases,  the  numerical  results  show  good  agreement 
with  the  exact  solutions. 


We  have  also  computed  two  examples  using  the  dual  method,  with  the  leap 
frog  scheme  and  boundary  discretization  (3.4);  h^(x)  = sin  (ttx)  and  h^(x)  = 

sin  — X are  used.  The  numerical  results  for  u are  shown  in  Figure  7 and  Figure 
8,  respectively.  At  this  stage,  numerical  results  are  still  not  very  satisfac- 
tory and  the  computer  program  is  currently  being  Improved. 

For  the  constrained  case  studied  in  (§5) , we  have  computed  two  examples 
with  the  same  Lax-Wendroff  scheme  and  boundary  discretization  (3.4)  as  in  the 
primal  method.  Figure  9 is  the  numerical  solution  for  h^(x)  = sin  tTx,  with 
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control  constrained  by  -0.3  £ u(t)  £ 0 (0£t£l) . Figure  10  is  the  numerical  so- 
lution for  h,|^(x)  = 1 (0£x<0.5)  and  -1  (0.5<x£l.0),  with  constraint  -0,3£u(t)<0. 
For  these  two  examples,  no  exact  solutions  are  known  which  can  be  used  for  com- 
parison. We  have  managed  to  test  the  optimality  of  these  numerical  solutions 
through  other  means  and  have  found  that  they  are  indeed  very  satisfactory. 

§7.  Concluding  Remarks 

Although  the  order  of  convergence  of  our  numerical  results  has  not  been 
tested,  we  believe  that  for  C^([0,T])  optimal  controls  and  C^([0,1]  x [0,T]) 
states,  the  rate  is  0(h+k) . As  indicated  in  §2,  the  control  and  the  state  in 
general  do  not  have  such  smoothness.  Thus,  the  rates  of  convergence  can  be  ex- 
pected to  be  quite  slow  for  hyperbolic  control  problems.  If  we  know  a priori 
where  the  locations  of  discontinuities  of  u and  y are,  then  using  high  accuracy 
difference  schemes  can  usually  increase  the  rate  of  convergence. 

In  using  the  dual  method,  the  preliminary  calculations  for  (4.8),  (4.9)  and 
the  subsequent  programming  work  are  much  larger  in  amount  than  the  total  work 
in  the  primal  method.  Consequently,  errors  tend  to  occur  more  often  and  are 
harder  to  correct,  and  computation  costs  are  much  higher.  It  seems  to  us  that 
the  primal  method  is  more  preferable. 

In  applications,  only  feedback  controls  are  used.  To  obtain  such  closed 
loop  controls,  one  must  solve  or  compute  the  (nonlinear  hyperbolic)  Riccati 
partial  differential  equation  first.  Unfortunately,  work  in  this  area  is  in- 
complete [1].  This  remains  challenging  research  work  for  control  theorists  in 
the  future. 


REFERENCES 


[1]  H.  L.  Koh,  Structure  of  Riccati  equation  solutions  in  optimal  boundary  con- 
trol of  hyperbolic  equations  with  quadratic  cost  functionals.  Math,  Res. 
Center  Technical  Summary  Rep.  #1642,  Univ.  of  Wisconsin,  Madison,  June  1976, 

[2]  H.  0.  Krelss,  B.  Gustafsson,  A.  Sundstrom,  Stability  theory  of  difference 
approximations  for  mixed  initial  boundary  value  problems  (II),  Math.  Comp. 

26  (1972),  649-686. 

[3]  H.  0.  Kreiss,  Numerical  Methods  for  Solving  Time-Dependent  Problems  for 
Partial  Differential  Equations,  Univ.  of  Montreal  Press,  Montreal,  Quebec, 
Canada,  1978. 

[4]  D.  L.  Russell,  Quadratic  performance  criteria  in  boundary  control  of  linear 
symmetric  hyperbolic  systems,  SIAM  J.  Control  11  (1973),  475-509. 

[5]  D.  L.  Russell,  Mathematics  of  Finite  Dimensional  Control  Systems,  Theory 
and  Design,  Marcel  Dekker,  New  York,  1979. 


489 


FINITE  DIFFERENCE  NUMERICAL  METHODS  FOR  BOUNDARY  CONTROL  PROBLEMS 

[6]  P.  Wolfe,  The  simplex  method  for  quadratic  programming,  Econometrica,  27 
(1959),  pp.  382-398. 


APPENDIX:  Preliminary  Computational  Results 


FINITE  DIFFERENCE  NUMERICAL  METHODS  FOR  BOUNDARY  CONTROL  PROBLEMS 


492 


493 


■'*Tj  11::!;'' 


HFlilllfn 
IM  ! i 


finite  difference  numerical  methods  fon  boundary  control  problems 


FINITE  DIFFERENCE  NUMERICAL  METHODS  FOR  BOUNDARY  CONTROL  PROBLEMS 


This  Page  Intentionally  Left  Blank 


SESSION  VI:  DISCUSSION  - LSS  APPLICATIONS 
OF  DISTRIBUTED  SYSTEM  THEORY 

Moderator:  R.S.  Gran,  Grumman  Aerospace  Corporation 
Panel  Members:  J.S.  Gibson,  University  of  California,  Los  Angeles,  V.  Komkov, 
West  Virginia  University,  D.L.  Russell,  University  of  Wisconsin, 
and  D.C.  Washburn,  A.F.  Weapons  Lab 


SYNOPSIS* 

R.  S.  GRAN;  The  title  of  this  session  is  LSS  Applications  of  Distributed  System 
Theory,  and  we  are  to  discuss  what  the  role  of  this  theory  is  in  future  large 
space  structure  (LSS)  designs.  The  theme  for  the  discussion  is  set  by  address- 
ing the  question:  Is  the  LSS  finite  or  infinite?  That  question  has  been 

addressed  a lot  today  and  yesterday.  How  can  we  treat  it  as  an  infinite  system? 
If  we  treat  it  as  an  infinite  system,  can  we  say  something  that  we  cannot  say  if 
we  treat  it  as  a finite  system? 

D.  C.  WASHBURN:  The  Air  Force  Weapons  Laboratory  (AFWL)  interests  in  large 

space  structures  are  a little  different  than  those  of  NASA.  AFWL  is  interested 
in  high-energy  laser  applications  in  space.  The  systems  have  very  large  deform- 
able mirrors  that  are  controlled  with  perhaps  thousands  of  actuators.  The  laser 
looks  a lot  like  a rocket  engine  as  far  as  vibrations  go.  There  are  isolation 
mechanisms  which  kill  many  of  the  vibrations.  The  coolant  flow  is  probably  the 
single  largest  source  of  vibration  to  the  forward  structure  where  the  mirrors 
are  located.  Another  significant  source  of  vibration  to  the  structure  is  intro- 
duced through  controlling  the  mirrors  to  shape  the  wavefront.  The  control  per- 
formance requirements  for  the  optics  are  extremely  tight,  much  tighter  than 
anything  mentioned  yesterday. 

Everybody  seems  to  be  using  NASTRAN  modeling  to  define  the  models,  which 
are  then  truncated  for  designing  the  controller.  The  design  is  then  patched  up 
to  take  care  of  the  "spillover"  problem.  The  idea  of  this  conference  was  to 
bring  together  the  partial-differential-equations  (PDE)  and  finite-element  people 
to  discuss  other  approaches. 

Finite-element  and  PDE  models  are  both  approximate  and  it  is  not  necessary 
to  argue  over  whether  the  system  is  really  finite  or  infinite.  Clearly,  only  so 
many  modes  are  of  Interest  in  the  final  design.  It  seems  that  the  finite- 
element  approach  has  been  thoroughly  investigated.  The  point  I would  like  to 
make  is  that  there  should  be  a little  more  investigation  of  PDE  modeling.  Not 
because  It  is  going  to  be  better  or  worse,  but  because  it  has  not  been  investi- 
gated very  deeply.  It  may  give  us  some  new  insight  or  have  other  attributes  to 
offer . 


*This  synopsis  attempts  to  capture  the  main  points  discussed  but  has  not  been 
reviewed  or  endorsed  by  the  speakers. 
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Some  research  at  JPL  using  a PDE  model  to  design  an  infinite-dimensional 
controller,  which  was  then  truncated  for  implementation,  showed  that  it  auto- 
matically took  care  of  the  spillover  problem.  The  point  is  that  you  are  trun- 
cating at  a different  point  in  the  problem.  Truncating  in  the  finite-element 
approach,  in  which  you  throw  away  modes,  implies  throwing  away  a whole  differ- 
ent kind  of  information.  You  might  expect  different  results  with  PDE  and  pos- 
sibly more  robustness.  That  was  a simple  example  but  it  provided  an  initial 
indication  of  the  PDE  methods'  potential. 

The  bottom  line  is  that  it  seems  to  me  that  a certain  percentage  of  our 
effort  ought  to  go  into  investigating  all  avenues,  including  applications  of 
approximate  PDE  models  to  controller  design. 

G.  lANCULESCU,  JPL:  The  finite-element  method  has  some  advantages  over  the  PDE 

approach,  but  it  cannot  be  used  for  parametric  studies.  In  controller  design, 
we  really  do  not  want  to  solve  only  one  problem,  but  a class  of  problems  with 
various  applications  toward  various  designs.  That  is  not  easy  with  a finite- 
element  model  because  you  have  to  arrange  for  new  data  sets  and  new  models  for 
each  application.  The  PDE  method  can  result  in  analytical  solution  to  perform 
parametric  studies. 

A lot  of  very  useful  work  on  PDE  control  exists,  especially  regarding  neces- 
sary conditions  for  optimality,  existence  of  solutions,  and  uniqueness.  One  can 
use  these  techniques  on  the  PDE  solution,  then  apply  some  approximation  to  imple- 
ment the  control  law.  Mathematicians  know  so  much  more  about  PDE  theory  than 
has  been  used  thus  far  in  designing  large  space  structures.  More  research  is 
needed  to  explore  these  types  of  things. 

L.  S.  WEISSTEIN,  Lockheed  Missiles  and  Space  Co.:  Most  of  the  comments  made 

here  come  to  the  same  conclusion:  the  models  in  general  are  pretty  crummy. 

There  are  mathematical  engineering  works  that  have  addressed  modeling  of  lattice 
structures  with  the  complexities  of  large  space  platforms,  large  antenna  booms, 
and  parabolic  dishes.  The  basic  two  approaches  that  are  used  are  the  discrete- 
field  method  and  the  continuum  modeling  technique.  In  certain  classes  of  prob- 
lems, it  is  possible  to  obtain  analytical  closed-form  solutions. 

J.  N.  JUANG,  Martin  Marietta  Aerospace:  Our  experience  would  indicate  that  you 

can  achieve  about  the  same  accuracy  with  finite-element  and  PDE  models  and  the 
solution  should  be  about  the  same. 

V.  KOMKOV:  I am  concerned  about  how  badly  both  finite-element  and  continuum 

models  generally  represent  the  real  physical  systems.  It  seems  that  people 
really  pay  no  attention  to  the  physics  of  the  problem  and  check  all  the  assump- 
tions made. 

A second  concern  is  that  the  models  must  include  damping  because  the  effects 
can  be  very  serious. 

A third  criticism  is  that  millions  of  dollars  are  spent  on  experiments 
before  an  adequate  theoretical  basis  has  been  established.  Good  experimental 
data  can  tell  you  why  an  experiment  is  being  performed  and  what  is  expected  to 
be  proved  or  disproved  by  the  experiment. 
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It  is  surprising  that  so  little  attention  is  given  at  this  workshop  to 
adaptive  control.  It  is  an  important  subject  in  the  control  of  large  space 
structures. 

Finally,  we  should  ask  the  question:  Do  we  always  really  need  to  suppress 

vibrations  in  large  space  structures?  In  some  cases,  it  may  not  be  necessary  to 
suppress  vibrations  to  achieve  the  mission  objectives. 

B.  R.  HANKS,  Langley:  I generally  agree  with  the  comment  that  most  of  the  time 

the  errors  in  modeling  are  directly  related  to  the  failure  of  the  structure  to 
obey  the  assumptions  used.  Simple  structures  can  be  modeled  well  by  either  PDF 
or  finite-element  methods,  but  the  difficult  problem  comes  from  trying  to  model 
a real  vehicle  such  as  Voyager,  for  example,  PDF's  are  regularly  used  in  struc- 
tures to  do  parametric  studies,  and  I would  think  that  controls  people  would  do 
the  same.  When  you  get  ready  to  design  or  analyze  a complicated  system  that  is 
to  fly,  then  you  should  turn  to  the  finite-element  approach.  Nonlinearities, 
joints,  and  damping  present  the  real  difficulties  in  complicated  systems. 

D.  FLDRFD,  JPL:  It  is  disturbing  the  way  people  neglect  damping  in  large  space 

structure  models.  Damping  is  very  important,  and  I do  not  see  a lot  of  effort 
on  modeling  damping  effects  or  using  parameter  identification  techniques  to 
estimate  damping  coefficients. 

DISCUSSION:  A discussion  followed  among  the  audience,  Fldred,  Gran,  and  Gibson 

concerning  adaptive  control  and  identifying  damping  parameters.  A question  was 
raised  as  to  whether  damping  uncertainties  would  be  of  much  Importance  if  adap- 
tive control  was  used.  One  response  was  essentially  that  it  is  important  to  know 
quite  a bit  about  the  space  structure  characteristics,  including  damping,  in 
order  to  design  an  effective  adaptive  controller.  Part  of  the  reason  that  you 
do  not  hear  much  about  adaptive  control  is  that  it  is  highly  nonlinear  and  dif- 
ficult to  design. 

A specific  question  from  the  audience  was  addressed  to  Gibson  on  the  use  of 
the  semigroup  framework  in  parameter  estimation  and  adaptive  control.  Would  it 
be  possible  to  use  very  sophisticated  off-line  estimation  algorithms  based  on 
semigroup  theory  to  determine  changes  in  the  space  structure  characteristics, 
and  then  change  the  control  gains  in  a linear  optimal  control  law?  The  control 
would  still  be  linear  and  the  adaptation  would  be  in  the  form  of  a moving  window 
around  the  Riccati  equation.  Gibson  responded  that  he  thought  it  would  be  pos- 
sible and  that  he  is  beginning  to  study  parameter  estimation  in  the  semigroup 
context.  The  semigroup  theory  is  quite  useful  in  providing  convergence  results. 
If  you  look  at  the  adaptive  control  literature,  all  of  the  global  stability 
results  that  are  available  for  adaptive  control  schemes  generally  rely  on  know- 
ing the  structure,  and  the  dimension  of  the  system.  That  translates  into  having 
to  know  something  about  the  structure  of  the  semigroup  when  the  system  is 
infinite-dimensional.  Gibson  thought  that  the  semigroup  theory  will  be  useful 
certainly  in  off-line  parameter  estimation  and  expected  that  it  will  be  useful 
in  on-line  adaptive  parameter  estimation  and  control. 

D.  L.'  RUSSFLL:  There  are  several  reasons  by  PDF  models  are  needed.  First  of 

all,  PDF's  are  free  from  an  arbitrary  dimensionality  imposed  through  discretiza- 
tion. We  need  to  recognize  that  discretization  is  an  error  that  we  Impose, 
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externally,  into  our  model,  and  to  the  extent  that  we  can  avoid  in  analysis,  we 
certainly  should. 

PDE  models  are  also  free  from  the  qualitative  differences  that  arise  when 
we  try  to  decide  what  sort  of  elements  to  use,  whether  we  are  going  to  use  piece- 
wise  linear  elements  or  cubic  splines  or  what  have  you.  They  are  not,  of  course, 
free  from  all  sorts  of  qualitative  differences.  If  you  look  at  all  the  models 
that  are  available  for  modeling  beams  — the  Euler  beam,  the  Bernoulli  beam,  and 
the  Timoshenko  beam  — certainly  there  are  plenty  of  qualitative  differences 
among  these.  To  the  extent  possible,  we  should  not  compound  that  with  the  kinds 
of  differences  that  arise  through  the  discretization  procedures,  even  though,  in 
the  end,  you  are  going  to  have  to  discretize. 

The  third  point,  a very  serious  one,  is  that  the  control  properties  may  be 
radically  different  for  a distributed  system  from  those  that  are  exhibited  by 
any  finite-dimensional  system  of  ordinary  differential  equations.  Prime  exam- 
ples are  the  hyperbolic  systems  in  which  all  finite-dimensional  models  are,  in 
principle,  controllable  in  an  arbitrarily  short  length  of  time,  whereas  the 
infinite-dimensional  models,  at  least  for  point  control  as  distinct  from  dis- 
tributed control,  require  a certain  minimum  time  before  control  can  be  carried 
out.  If  you  go  to  the  parabolic  systems,  there  are  very  severe  limitations 
with  regard  to  the  smoothness  of  final  states  that  actually  can  be  achieved  with 
controls,  none  of  which  show  up  in  discrete  models.  Indeed,  I am  not  sure  — 
just  based  on  discrete  models — whether  the  fundamental  differences  between 
hyperbolic  and  parabolic  systems  could  be  formulated  and  perceived,  in  the  exact- 
ness that  we  have  them,  based  on  partial  differential  equations. 

The  fourth  point,  of  particular  relevance  here,  is  that  the  control  based 
on  finite-dimensional  approximation  may  take  advantage  of  what  I would  call 
"accidental"  relationships  between  coefficients.  Accidental,  in  the  sense  that 
they  are  a consequence  of  the  particular  discretization,  or  particular  dimension 
chosen,  or  whatever.  And  if  the  control  law  takes  advantage  of  these  more  or 
less  accidental  variations  and  coefficients,  we  may  very  well  arrive  at  a false 
conclusion  and  lack  of  robustness,  and  so  on,  simply  because  those  relationships 
will  not  hold  if  you  change  the  dimension  a little  bit  or  change  the  type  of  dis- 
cretization. So,  while  we  have  granted  that,  in  the  end,  the  finite-dimensional 

models  must  be  used,  nevertheless  the  control  law  Itself  should,  if  at  all  pos- 
sible, be  formulated  in  such  a way  that  it  is  relatively  free  from  the  effects 
of  discretization. 

The  fifth  point  is  that  partial  differential  equations  are  very  intimately 
related  to  conservation  laws  and,  in  most  cases,  the  parallel  conservation  laws 
should  be  taken  advantage  of  when  discrete  models  are  employed.  The  discrete 
models  should  be  as  close  as  possible  to  the  kind  of  things  we  would  do  if  we 
were  going  to  develop  a partial  differential  equation  for  the  same  system. 

There  is  another  Important  point  to  make  here:  in  early  20th-Century 

physics,  the  idea  of  a thought  experiment  was  very  important,  and  it  is  certainly 
still  important.  One  conceived  an  experiment  in  one's  mind,  which  was  not  actu- 
ally to  be  carried  out,  but  which  clarified  the  mental  processes  that  were  being 
applied  to  the  problem  at  hand.  A partial  differential  equation  model  can  be 
thought  of  as  a gedanken experiment  in  that  sense.  What  we  have  sitting  out 
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there  in  the  world  are  distributed  processes,  distributed  plants  for  which  no 
finite-dimensional  model  is  going  to  be  completely  accurate.  Now  the  point  with 
a partial  differential  equation  is  that  it  is  a relatively  manageable  conceptual 
model  that  has  some  of  the  same  properties.  Also,  it  is  such  that  no  finite- 
dimensional approximation  is  going  to  be  completely  accurate  in  describing  its 
behavior.  If  we  seriously  expect  to  control  these  very  complex  objects  in  the 
real  world  with  relatively  simple  control  strategies,  those  same  strategies  ought 
to  work,  mathematically,  in  handling  partial  differential  equations,  whose  rela- 
tionship to  finite-dimensional  systems  is  much  better  understood  than  is  the 
relationship  between  physically  distributed  processes  and  finite-dimensional 
systems.  So,  there  is  this  thought-experiment  side  to  partial  differential  equa- 
tions, which,  in  fact,  I think  is  one  of  the  stronger  points. 

The  seventh  point  is  the  relative  manipulative  simplicity  of  partial  differ- 
ential equations.  We  all  know  that  one  of  the  reasons  for  the  success  of  the 
calculus  is  that  it  is  a lot  easier  to  do  calculus  than  it  is  to  do  the  parallel 
discrete  mathematics.  Part  of  the  thinking  process  is  manipulative  and  being 
able  to  work  with  equations.  It  is  difficult  to  work  with  the  vast  mass  of  equa- 
tions that  result  from  finite-element  methods,  valuable  though  they  are  in 
numerical  procedures.  It  is  unimaginable  that  they  could  be  used  very  effectively 
in  conceptual  studies,  which  are  very  important.  Can  you  imagine  Maxwell's 
equations  being  developed  from  the  beginning  in  terms  of  three-dimensional  cubic 
spllnesj  or  something  of  that  sort?  There  are  very  Important  general  principles 
in  control  theory  of  distributed  systems  that  simply  are  not  going  to  emerge  or 
become  apparent  unless  continuous  models  are  used. 

A final  point  that  should  be  made  is  that  PDE's  are  not  sacred  in  any 
respect.  You  must  consider  what  you  need  based  on  the  physical  system.  For 
example,  to  obtain  structural  damping  in  a linear  oscillatory  system,  in  which 
the  rate  of  attenuation  is  proportional  to  the  magnitude  of  the  frequency,  one 
has  to  put  in  nonlocal  operators,  not  differential  operators,  in  order  to  real- 
ize that  type  of  behavior  of  the  spectrum.  Right  away  you  are  dealing  with  a 
class  of  objects  that  no  longer  is,  strictly  speaking,  in  the  set  of  partial 
differential  equations. 

L.  MEIROVITCH,  Virginia  Polytechnic  Institute  and  State  University  (VPI) : I 

fully  agree  with  Professor  Russell.  Many  of  the  problems  now  in  control  theory 
are  caused  by  trying  to  apply  finite-dimensional  control  theory  to  infinite- 
dimensional  systems.  People  like  to  talk  about  large  structures;  in  fact,  they 
quote  that  future  spacecraft  will  be  the  size  of  Manhattan,  and  by  that  they 
automatically  shift  attention  to  size  rather  than  order.  When  we  are  talking 
about  large  structures,  we  should  think  in  terms  of  large-order  systems,  not  in 
terms  of  physical  dimensions.  What  is  really  needed  is  a control  theory  which 
will  close  the  gap  between  the  finite-dimensional  modern  control  theory  and  the 
infinite-dimensional  systems  that  we  really  have. 

R.  S.  GRAN:  This  discussion  is  reminiscent  of  the  fight  not  too  long  ago  between 
classical  and  modern  control  engineers.  As  we  know,  some  of  the  recent  research 
on  robustness  has  helped  fill  the  gap  between  the  two.  Actually  the  gap  was 
more  perceived  than  real.  The  research  provided  the  understanding  to  remove  the 
perceptual  problems.  When  we  consider  the  large  space  structure  problem,  we 
should  really  work  on  filling  the  gap,  real  or  perceived,  between  finite-element 
and  continuum-model  approaches. 
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J.  S.  GIBSON:  In  modeling  large  space  structures,  we  have  been  discussing 

primarily  two  methods:  finite-dimensional  discrete  models  and  PDE  models. 

.Those  are  not  the  only  possibilities.  It  is  not  true  that  if  you  want  to  use 
an  infinite-dimensional  model  that  it  must  be  PDE.  For  example,  if  you  want  a 
model  that  has  the  same  damping  ratio  for  every  mode  of  a beam  or  plate,  there 
are  no  PDE  models  that  can  provide  that.  However,  semigroup  theory  would  handle 
that  case.  Those  of  us  who  advocate  treating  the  problem  in  the  infinite- 
dimensional  context,  using  semigroup  theory,  are  not  necessarily  talking  about 
PDE's.  Semigroup  theory  includes  both  finite-dimensional  and  infinite- 
dimensional  situations. 

D.  C.  WASHBURN:  I don't  think  the  discussion  should  center  around  the  differ- 

ence between  PDE  and  finite-element  modeling.  We  should  be  looking  at  whether 
we  are  going  to  make  approximations  along  modal  lines  or  along  some  other  lines. 
Using  PDE  you  can  make  approximation  in  other  ways. 

R.  S.  GRAN:  One  class  of  engineers  that  are  not  really  represented  at  this 

workshop  are  those  on  the  firing  line  to  design  and  build  the  space  systems  that 
actually  fly.  Those  people  have  been  designing  systems  for  years,  very  success- 
fully, and  they  would  not  know  a PDE  if  they  tripped  over  it.  What  do  we  tell 
those  people  about  why  they  should  use  PDE  models?  (There  were  no  direct 
responses  to  this  question.) 

Comments  and  Questions  from  the  Audience 

R.  P.  IWENS,  TRW:  It  seems  the  consensus  of  the  panel  is  that  we  should  be 

looking  more  at  partial  differential  equations.  I would  like  to  ask  the  people 
who  advocate  the  use  of  partial  differential  equations,  how  do  we  model  the 
structure  flexibility  in  the  Voyager?  We  may  learn  a lot  from  the  theory  by 
studying  control  of  simple  structures  using  partial  differential  equations,  but 
the  question  still  remains:  if  we  look  at  real  structures,  how  are  we  going  to 
model  something  as  complex  as  the  Voyager  with  PDE's? 

V.  KOMKOV:  Well,  it  has  been  done  recently.  There's  a group  at  the  University 

of  Iowa  that  have  taken  some  extremely  complex,  very  large  structures  and  devel- 
oped a combined  finite-element  and  PDE  model.  Of  course,  it  is  a computer  algo- 
rithm in  which  they  worked  for  a long  time,  but  it  can  be  done. 

G.  RODRIGUEZ,  JPL:  There  is  no  alternative  to  the  partial  differential  equation 

approach,  in  that  the  finite-element  model  is  itself  based  on  partial  differ- 
ential equations.  To  obtain  a finite-element  model  we  partition  the  structure 
into  a number  of  segments,  subassemblies,  or  substructures,  and  within  each  sub- 
assembly  we  assume  a simple,  partial  differential  equation.  It  could  be  a beam, 
it  could  be  a plate,  membrane,  whatever.  But  nonetheless,  there  is  a partial 
differential  equation  model.  For  complicated  structures,  this  process  results 
in  a very  complicated  set  of  partial  differential  equations  coupled  by  inter- 
element or  interface  conditions.  We  have  been  very  successful  in  implementing 
approximate  solutions  to  these  complicated  equations  to  the  extent  that  finite- 
element  techniques,  which  involve  approximating  the  solutions  by  interpolating 
polynomials,  have  been  very  successfully  developed  and  implemented  in  computer 
programs  such  as  NASTRAN.  So  the  mechanism  to  obtain  the  eigenvalues  and  eigen- 
functions of  that  complicated  partial  differential  equation  model  has  been 
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automated,  and  that  is  what  was  used  in  the  Voyager  spacecraft  and  is  used  in 
many  other  spacecraft.  There  is  the  possibility  of  going  beyond  the  computation 
of  eigenvalues  and  eigenfunctions  and  determine  control  and  estimation  gains 
directly  by  means  of  the  finite  element  method.  In  other  words,  we  would  start 
with  the  complicated  partial  differential  equation  model,  which  we  do  not  even 
have  to  write,  just  as  we  do  not  have  to  write  the  model  for  NASTRAN.  We  just 
assume  that  it  exists,  and  proceed  to  use  the  approximating  polynomials,  to 
solve  not  only  for  the  eigenvalues,  but  to  solve  also  for  the  control  gains  and 
the  feedback  matrices.  So  I think  that  the  issue  is  not  really  whether  there 
is  an  option  to  the  partial  differential  equation  — there  really  is  not.  The 
question  is  whether  we  can  develop  numerical  algorithms  or  numerical  techniques, 
approximation  techniques,  to  compute  directly  more  than  the  eigenvalues  and  the 
eigenfunctions  of  a complicated  structure. 

L.  MEIROVITCH,  VPI:  There  is  a method  that  is  called  Component-Mode,  which  was 

developed  by  Professor  Walter  G.  Hurty  from  UCLA,  that  can  be  regarded  as 
envisioning  the  spacecraft  as  a collection  of  PDE's,  each  member  being  repre- 
sented by  a set  of  so-called  modes,  which  could  be  obtained  by  solving  a partial 
differential  equation  for  each  of  the  members. 

A.  N.  MEYSTEL,  University  of  Florida:  I would  prefer  to  have  as  complex  a model 
as  possible.  There  is  no  rush  during  design,  and  it  is  possible  to  have  a very 
complex  model  of  the  system  being  designed  with  all  the  partial  differential 
equations  necessary  to  describe  the  system.  Then  the  control  person  can  try 
this  model  and  find  out  which  simplified  models  can  be  built  for  control. 

B.  R.  HANKS,  Langley:  I want  to  reiterate  in  a slightly  different  way  what  I 

said  earlier.  I think  the  argument  between  finite-element  models  and  partial 
differential  equation  models  is  academic  because  the  real  errors  come  about  due 
to  the  structure  not  obeying  any  sort  of  equation. 

For  example,  on  Voyager,  the  problems  were  that  the  joints  have  motion  and 
that  pieces  on  the  arm  move  about,  rattle,  and  cause  all  sorts  of  problems  that 
are  not  modeled  in  anybody's  model.  The  error  in  truncating  one  model  versus 
truncating  another  model  is  not  going  to  be  anything  like  what  it  is  in  the  real 
structure.  So  use  whichever  one  is  necessary  and  actually  design  something,  but 
you  can  not  guarantee  stability  in  any  case  until  you  put  it  in  hardware  and  fly. 
Take  something  like  the  Voyager  and  extend  it  to  30  times  its  size.  The  little 
arms  out  there  that  did  not  have  to  be  controlled  in  the  Voyager  case  will  have 
to  be  controlled.  So  we  do  have  that  problem  to  work  with,  and  component-mode 
synthesis,  everything  we  have  available  to  use  will  not  solve  the  problem  by 
itself . 
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ABSTRACT 


We  discuss  theoretical  and  computational  results  for  spline  based  ap- 
proximation schemes  used  in  parameter  estimation  algorithms  for  distributed 
systems.  Specific  applications  include  beam-like  structures  described  by  the 
Euler-Bernoulli  and  Timoshenko  theories  and  antenna  surfaces  such  as  that  in 
the  deployable  Maypole  Hoop/Column  model. 


INTRODUCTION 


With  the  use  of  composite  materials  in  large  space  structures  and  the 
exotic  shapes  and  configurations  for  antennae,  space  stations,  etc.,  the  need 
for  analysis  with  distributed  system  models  to  describe  complex  structures  in 
changing  environments  has  become  evident.  The  expected  fatigue,  degradation, 
and  changes  in  material  properties  due  to  ageing  and  environmental  stress 
increase  the  importance  of  parameter  and  state  estimation  techniques  for  such 
models.  We  report  here  on  our  investigations  of  methods  for  parameter  estima- 
tion. The  ideas  involve  spline  based  approximation  schemes  to  reduce  the 
distributed  system  problem  to  approximate  finite  dimensional  state  system 
problems  where  existing  algorithms  can  be  employed.  Our  goals  have  been  to 
guarantee  convergence  of  our  methods  and  to  test  their  numerical  feasibility. 
As  we  shall  outline,  the  methods  can  be  successfully  used  with  both  static  and 
dynamic  systems  data. 


DYNAMIC  MODEL  PARAMETER  ESTIMATION 

We  present  a brief  summary  of  our  joint  efforts  with  J.M.  Crowley 
(U.S.  Air  Force  Academy)  reported  in  more  detail  in  [1],  [2],  [3].  We  consider 
the  following  problem:  In  a dynamical  model  (e.g.  Euler-Bernoulli  or  Timoshen- 

ko theories)  for  elastic  structures,  estimate  parameters  (such  as  flexural 
rigidity,  shear  rigidity,  structural  damping,  loading,  etc.)  in  the  model  from 
observations  of  the  system. 
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Theory 


For  convergence  results,  we  have  developed  a semigroup  approximation 
framework  for  these  higher  order  models  that  is  related  to  that  given  for 
second  order  distributed  systems  in  [4] , [5] . A rather  concise  series  of 
steps  can  be  used  to  describe  these  efforts. 

(i)  We  write  the  system  (partial  differential  equation  with  boundary 
and  initial  data)  to  be  Investigated  as  an  abstract  evolution 
equation 

(5)  z(t)  = A(q)z(t) 


in  an  appropriately  chosen  Hilbert  space.  Here  the  operator  A 
(and  possibly  the  initial  data  Zq)  depend  on  the  vector  of 
parameters  q to  be  estimated,  for  example,  by  a least-squares 
fit  to  the  observation  data. 

(ii)  Wg  choosy  agproximatlon^subspaces  to  Z and  operators 

A = P AP  , where  P is  the  orthogonal  projection  of  Z onto 
Z^ . This  gives  rise  to  an  approximating  system 

z^(t)  = A^(q)z^(t) 

z^(0)  - P^Zq 

in  a finite  dimensional  space  z'^.  (We  have  found  it  very 
profitable  to  use  linear  spans  of  spline  basis  elements  - linear, 
cubic,  quintlc  - for  these  subspaces.)  An  associated  sequence 
(N  = 1,2,...)  of  estimation  problems  for  solved, 

yielding  approximate  parameter  estimates 

(ill)  A convergence  theory  for  q^  q,q  a solution  of  the  estimation 

problem  for  (S) , is  obtained  by  employing  general  linear  semigroup 
approximation  results  (the  Trotter-Kato  theorem)  along  with  funda- 
mental estimates  on  how  Z^  and  A approximate  Z and  A 
respectively  (i.e.  how  (%)  approximates  (S)).  (In  our  efforts 
these  estimates  are  obtained  from  basic  approximation  theorems  in 
spline  analysis  - e.g.  [6], [7].) 

We  note  that  while  most  of  our  efforts  in  problems  for  elastic  struc- 
tures have  dealt  with  estimation  of  constant  parameters,  our  theoretical 
ideas  (as  well  as  the  associated  computational  packages)  are  readily  extend- 
ed to  treat  problems  with  spatially  varying  coefficients.  Our  initial 
computational  findings  for  these  more  difficult  problems  indicate  that  the 
resulting  algorithms  are  very  efficient. 
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Implementation 


In  practice  we  have  used  the  spline  approximation  schemes  with  a 
standard  package  (IMSL-ZXSSQ)  for  the  Leyenberg-Marquardt  finite  dimensional 
optimization  technique.  The  resulting  algorithms  have  proved  (as  predicted 
by  the  theory)  quite  efficient  in  our  program  of  extensive  numerical  testing. 

We  have  developed  and  tested  algorithms  based  on  qulntic,  cubic,  and  linear 
spline  generated  subspaces  (the  choices  we  used  in  each  example  depending  to 
some  extent  on  the  particular  system,  the  desired  accuracy,  and  the  amount 
of  computational  effort  we  were  willing  to  expend) . 

In  our  testing  of  the  algorithms  we  have  focused  on  dynamic  beam  models. 
Our  emphasis  here  has  been  largely  motivated  by  the  Interest  of  engineers  at 
NASA  and  elsewhere  in  the  analysis  of  large  complex  structures  through  the 
use  of  equivalent  simple  continuum  models  (e.g.  see  [8] , [9 ] , [10] ) . 


Examples 

We  have  developed  the  theory  and  carried  out  numerical  testing  for  a 
number  of  situations  including  the  following. 


(A)  Viscoelastic  models  (the  Euler-Bernoulli  theory) : Equations  such  as 
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with  various  types  of  boundary  conditions  (combinations  of  fixed, 
simple  and  free)  have  been  Investigated  and  parameters  such  as 
El  (flexural  rigidity) , ^ (structural  damping) , and 

tri'  m m 

(viscous  damping)  have  been  successfully  estimated.  Schemes  with 
qulntic  and  cubic  spline  elements  were  employed. 


(B)  Models  with  shear  and  rotatory  inertia  (the  Timoshenko  theory) : 
Equations  studied  include  those  for  the  transverse  displacement 
y and  angle  ijj  of  cross  sectional  rotation 
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with  a = k'AG/m,  3 ==  aA/l,  y “ EA/m  (E,G,I,k',A  represent  the 
usual  Young's  modulus,  shear  modulus,  moment  of  inertia,  shear 
coefficient,  and  cross  sectional  area  respectively)  among  the 
parameters  estimated.  For  a fixed  end  beam,  cubic  elements  were 
employed  while  both  cubic  and  linear  element  schemes  were  tested 
for  the  cantilever  beam. 


STATIC  MODEL  PARAMETER  ESTIMATION 

Our  efforts  on  static  estimation  have  involved  at  various  stages  joint 
efforts  with  P.  Daniel  (Southern  Methodist  University),  E.  Armstrong  (NASA 
Langley) , and  R.  Teglas  (ICASE,  NASA  Langley) . In  this  case  the  semigroup 
formulation  for  the  theory  underlying  our  algorithms  is  not  needed.  Instead 
we  use  a weak  or  variational  equation  formulation 


(AQ(q)u-f,  v)  = 0 

of  the  equation  of  state  in  a Hilbert  space.  However  the  general  steps  (i)- 
(iii)  outlined  above  in  the  dynamic  case  are  again  followed.  In  this  case 
we  also  use  spline  subspaces  (linear  and  cubic  elements)  for  the  approxima- 
tions, combining  standard  spline  theory  estimates  with  variational  inequali- 
ties to  obtain  the  convergence  theory  of  (ill) . 

In  our  implementation  we  have  again  used  spline  schemes  with  the 
Levenberg-Marquardt  to  generate  and  test  our  algorithms.  We  are  presently 
still  testing  the  methods  on  examples,  but  our  initial  computational  findings 
are  very  promising. 


Examples 


We  have  developed  the  theory  and  are  testing  our  algorithms  on  a 
distributed  model  for  the  antenna  surface  in  the  deployable  Maypole  Hoop/ 
Column  configuration  under  development  by  the  Harris  Corp.  Our  investigations 
have  focused  on  the  variational  form  state  equation 

2tt  R^ 

{E  V u • V V - fvjrdrdO  = 0 

•'0 

where  u is  the  vertical  displacement  (from  hoop  level  u=0) , E = E(r,0)  is 
the  stiffness  (elastic)  coefficient,  and  f represents  the  applied  distri- 
buted load  (e.g.  through  the  control  stringers  and  catenary  cord  elements). 

In  a simplified  1-dimensional  test  example  (for  which  the  convergence  theory 
is  rather  easily  obtained)  where  we  assume  angular  symmetry,  we  have  employed 
with  success  both  cubic  and  linear  element  approximation  schemes.  Convergence 
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results  in  the  2-dlmenslonal  model  can  be  obtained  but  require  somewhat  more 
effort  than  in  the  1-dimensional  case.  We  are  currently  in  the  process  of 
numerically  testing  the  algorithms  for  this  more  general  model. 
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Abstract 


Two  methods  for  identifying  the  mass,  damping  and  stiffness  matrices  of  a 
linear  vibrating  system  are  presented.  Both  methods  require  the  measurement  of 
acceleration,  velocity  and  displacement  at  various  locations  of  the  system.  In 
the  first  method,  the  response  of  the  system  subjected  to  known  forces  is  used 
while  the  second  method  employs  the  free  vibration  data.  The  unknown 
parameters  are  recovered  through  the  standard  least  squares  procedure. 

Numerical  results  are  presented  for  several  examples. 

Introduction 


Many  proposed  space  missions  for  the  coming  decades  involve  large  space 
structures  (LSS).  Typical  applications  are  large  communication  antenna,  manned 
orbiting  stations,  astronomical  telescopes  and  solar  power  stations.  All  these 
structures  need  to  be  actively  controlled  to  realize  their  mission 
objectives.  A fundamental  problem,  which  poses  a significant  obstacle  to 
reali2'.ing  the  missions  for  LSS  is  that  the  mass,  damping  and  stiffness 
characteristics  will  not  be  accurately  known  apriori.  The  structures  are  made 
flexible  to  meet  the  weight  constraints  and  hence  fully  erected  or  deployed 
ground  vibration  tests  are  not  generally  possible.  The  use  of  composite 
materials  results  in  uncertain  time  variations  of  the  structural  parameters. 
Further  variations  result  from  vehicle  re-configuration,  thermal  environment 
and  fuel  consumption.  Active  control  of  LSS  necessitates  an  accurate  estimate 
of  the  parameters  so  that  control  laws  can  be  tuned  on-orbit  to  permit  less 
control  effort  to  be  expended. 

The  LSS  are  described  by  partial  differential  equations  and  are  associated 
with  infinite  number  of  modes  of  vibration.  Usually  a finite  element  method  is 
used  to  obtain  a finite  dimensional  model  for  LSS.  Accordingly,  two  approaches 
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to  the  identification  problem  can  be  found  in  the  literature.  One  approach 
(Ref.  1-5)  is  to  identify  the  discrete  or  lumped  parameter  model  of  the 
system.  The  other  approach  (Refs.  6-9)  is  to  identify  the  distributed  model 
i.e.,  the  partial  differential  equation  along  with  the  boundary  conditions.  In 
this  paper  we  adopt  the  first  approach.  Here,  we  first  formulate  a forced 
response  problem  to  identify  the  mass,  damping  and  stiffness  matrices.  The 
free  response  problem  based  on  this  formulation  degenerates  to  the  method 
discussed  in  Ref.  (1).  The  methods  proposed  in  this  paper  require  the 
measurements  of  acceleration,  velocity  and  displacement  at  various  locations  of 
the  system.  The  unknown  parameters  are  recovered  through  a standard  least 
squares  procedure^^.  Numerical  examples  to  illustrate  the  concepts  are  also 
presented. 


Forced  Vibration  Method 


Consider  a LSS  whose  dynamics  is  governed  by  the  following  linear  matrix 
differential  equation 


Mx  + Cx  + Kx  = F 


(1) 


where 

x:  nxl  configuration  vector  of  physical 

displacement 

M:  nxn  symmetric  positive  definite  mass 

matrix 

K:  nxn  symmetric,  positive  semi -definite 

stiffness  matrix 

C:  nxn  symmetric,  positive  semi -definite 

damping  matrix 

F:  nxl  force  vector 


Equation  (1)  can  be  rewritten  as 


T:  transpose 


(2) 


Now  we  consider  a measurement  process  wherein  the  position,  velocity, 
acceleration  and  the  forces  are  measured  at  discrete  instants  (tj,  t2> 
t|^) . Also,  we  consider  the  worst  case  in  which  the  apriori  M,  C and  K matrices 
are  felt  unreliable  and  we  seek  to  establish  best  estimates  of  all  elements  of 
M,  C and  K.  Upon  writing  m equations  identical  to  Eqn.  (2),  one  for  each 
measurement  time,  the  resulting  m matrix  equations  can  be  recollected  in  the 
following  form 
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AP  = U (3) 

where  A is  a mx3n  matrix  whose  row  contains  measurements  of  the  system 
response  at  time  tj. 

row  of  A = [x^(tj)  x^(tj.)]  (4) 

U is  a mxn  matrix  containing  the  forcing  functions 


row  of  U = [F^(tj)]  (5) 

P is  a 3nxn  matrix  containing  the  unknown  mass,  damping  and  stiffness 
parameters. 


P == 


Letting  { }.  denote  the  i^*^ 
written  as  ' 


(6) 


column  of  a rectangular  matrix,  Eq.  (3)  can  be 


A[P}^.  = {U}.  i = 1,2,...n  (7) 

Since  the  number  of  elements  in  {P}.  is  3n,  if  m > 3n,  then  Eqs.  (7)  over- 
deternrine  {P}j;.  The  only  requirement  is  that  the  A matrix  has  full  rank. 
Assume  that  tne  least  squares  solution  minimizes  the  residual  sum  square  error 

>.j  =2  (8) 

where 

= {U>i  - A{P}^  (9) 

Then  we  can  write 

= L[U}^  i = l,2,...n  (10) 

where  the  least  square  operator 

L = (aTa)-IaT  (11) 

Notice  that  L is  invariant  so  that  it  can  be  calculated  once  and  re-used  n 
times  to  calculate  the  n columns  of  P.  Obviously  the  method  is  straight 
forward.  However  it  requires  a large  number  of  sensors.  Also  the  number  of 
forces  must  be  equal  to  the  order  of  the  system. 

Now  consider  partitioning  the  configuration  space  so  that  equations  of 
motion  are 
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•• 


X-,  and  ^2  3r*e  (n^xl)  and  (n2Xl)  vectors  respectively.  Note  n^+n2  = n.  The 
degrees  of  freedom  in  x.  are  excited  directly  through  the  external  forces. 
x„  refers  to  the  degrees  of  freedom  that  are  excited  through  the  coupling 
terms.  The  question  is  whether  we  can  obtain  all  the  parameters  by  using 
forces  less  than  the  number  of  degrees  of  freedom.  Equation  (12)  can  be 
written  as  r-  -i 


rx'T  x^ 


kl  xj] 


One  can  easily  observe  that  x^  is  a linear  combination  of  x^  x^  x-j , x^ 

and  x^.  Hence  the  A matrix  is  rank  deficient  and  the  least  squares  process 
fails;  This  difficulty  can  be  overcome  if  we  assume  that  the  matrix 
known  apriori.  Then  the  A matrix  can  be  formed  such  that 

row  of  A = 


x|(tj)  x|(tj)  xJ(tj)  xJ(tj)] 
Also,  row  of  U matrix  = 

The  P matrix  is  given  as 


We  can  solve  for  P as  before.  The  matrices  M22,  C22  and  K22  are  obtained  in 
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where  M,  C and  K are  as  defined  earlier.  G is  the  gyroscopic  matrix  and  H is 
the  circulatory  matrix.  G and  H are  skew  symmetric.  Premultiplying  Eq.  (25) 
by  M“  ' 

X + (G  + C)x  + (K  + H)x  = 0 (26) 


Equation  (26)  can  be  written  as 


Ci"  xT] 


(G  + C)V^ 
(K  + H) 


= -X 


Thus  we  can  solve  for  M"^(G  + C)  and  M“](K  + H) . 
solved  from  a single  matrix  D given  as^^ 


(27) 


The  eigenvalue  problem  is 


M"^ (G  + C)  -M"^ (K  + H) 
I 0 


The  free  response  data  can  therefore  be  used  to  determine  the  eigenvalues  and 
eigenvectors  of  a general  dynamic  system 


Numerical  Examples 

Example  1 ; Figure  (1)  shows  a five  degree  of  freedom  mass-spring  damper 
systenu  Tbe”  system  matrices  are 


diac)[m^ 

m^  m 

3 ^^4 

11,5] 

k iH'k^ 

■k2 

0 

0 

0 

k2+ks 

CO 

1 

0 

0 

0 

k3+k4 

-l<4 

0 

0 

0 

-k4 

k4+kg 

-*^5 

0 

0 

0 

"'^5 

■^5 

Cy\-C2 

-C2 

0 

0 

0 

-^2 

C2+C3 

_C3 

0 

0 

0 

-^3 

C3+C4 

-C4 

0 

0 

0 

-^4 

C4+C5 

"^5 

0 

0 

0 

-Cg 

^5 
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the  following  manner.  The  estimated  Cio  and  Kio 
matrix  Eq.  (12)  to  form 


M22 

'Mi2‘ 

[B] 

C22 

= -[V] 

C12 

"^22 

Ki2 

B and  V are  constructed  from  ti 

measurements  as 


are  substituted  into  the 


(17) 


row  of  B = [xj(^j) 

row  of  V = Cx|(tj)  ^|(^^j)  (19) 

The  linear  system  of  Eqns,  (17)  can  be  solved  through  least  squares  to  get  M22> 
C22  and  K22*  Usually  the  mass  matrix  is  diagonal  and  hence  Mj2  is  zero. 


Free  Vibration  Method 


Consider  the  free  vibration  problem  for  an  undamped  system 


,Mx  + Kx  = 0 


Since  M is  positive  definite  exists 
X + M"^  Kx  = 0 


(20) 

(21) 


Equation  (21)  can  be  rewritten  as 

x^[M"^K]  = -x^  (22) 

Let  D = M“^K.  As  before  we  can  solve  for  D by  least  squares  method.  The 
A matrix  in  this  case  is  formed  such  that 

jth  of  A = x^(t  .)  (23) 

J 

Also,  row  of  U = - x^(t.)  (24) 

U 

Notice  that  all  modes  should  participate  in  the  free  response.  Otherwise  A 
matrix  becomes  rank  deficient.  Also,  note  that  M“1k  is  the  eigenvalue  matrix 
for  the  system.  Hence  the  system  eigenvalues  and  eigenvectors  can  be  obtained 
from  n.  Even  for  low  dimensioned  systems  failure  to  excite  a particular  mode 
obviously  means  that  the  corresponding  eigenvalues  and  eigenvectors  cannot  be 
obtained.  Broad-band  random  initial  excitation  at  several  stations  is 
recommended  in  Ref.  (2).  In  case  we  need  to  estimate  M and  K explicitly,  we 
should  have  apriori  knowledge  of  the  mass  matrix. 


Now  we  can  generalize  the  free  vibration  method.  Consider  a general 
dynamic  system  expressed  as^^ 


Mx(l^)  + (G+C)x(t)  + (K+H)x(t)  = 0 (25) 
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I 


F2,X2 


F4,X4 


The  following  sets  of  forcing  functions  are  used 


Set  1 

Set  2 

Fl 

cost 

t 

>^2 

cos2t 

t2 

F3 

cos3t 

cost 

cos4t 

cos2t 

cos5t 

cos3t 

For  simulation  we  assumed 


mi  = m^  = m^ 


100  kg 


ki  = = ks  = = kg  = 36  N/m 

Cl  = c^  = = Cg  = 1 .2  N‘ sec/m 

All  the  parameters  are  recovered  very  well  for  both  the  sets  of  excitation. 


Example  2;  A triangular  plane  truss  (Fig.  2)  is  used  as  a second 
example,  the masses  and  spring  constants  are  assumed  to  be  the  same.  The 
equation  of  motion  for  small  displacements  takes  the  form 


Mx  + Kx 


where 


x'''  = [x. 
F^  = [F. 
M = ml 


K = k/2 


^1 

^2  ^2 

^^3 

^3] 

9 

^2 

f^3 

f^4 

■ 2 

0 

-2 

0 

0 

0 

0 

1 

0 

0 

0 

-2 

-2 

0 

3 

-1 

-1 

1 

0 

0 

-1 

1 

1 

-1 

0 

0 

-1 

1 

1 

-1 

. 0 

-2 

1 

-1 

-1 

3 

After  considerable  effort,  the  following  forcing  functions  proved  to  be  useful 
to  recover  the  parameters. 
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= sint 


= cost 

F^  = sin  0.2t 

F^  = 1 + (sin3t)^ 

Fg  = cos5t 

Fc  = cos6t 
6 

Simple  harmonic  functions  such  as  sint,  sin2t, , . ,sin6t  are  not  useful. 

Example  3:  A four  mass  five  spring  planar  truss  as  shown  in  Fig.  3 is 

used  as  the  third  example.  Assuming  small  deflection,  the  mass  and  stiffness 
matrices  are  given  as 

M = ml 


20  -2  00000 
02000  -2  00 
-2  0 3 -1  -1  1 0 0 

0 0-131-10-2 

K = k/2  00-1  1 3-1-20 

0-2  1-1-1  3 0 0 

0000  -2  020 
_0  0 0 -2  0 0 0 2 

x'  = CXj  Vi  yj  Xj  yj  y^] 

pT  = [Fi  Fj  f3  f4  ^6  ^6  fy  fe] 

Simple  harmonic  functions  as  forces  are  not  suitable  to  recover  all  the 
parameters.  Hence  we  decided  to  use  square-wave  type  excitation  as  shown  in 
Fig.  4.  Eight  forcing  functions  of  different  period  'T'  are  used.  The 
amplitude  of  the  forces,  f is  made  equal  to  0.078  N.  Defining  w.  = 2ir/T.,  i = 
1,2, ...8,  we  set  w.  = (i  + l)/2.  All  the  parameters  (for  zero  measureme?it 
noise)  were  recovered  to  their  exact  values.  The  important  factor  appears  to 
be  the  type  of  excitation. 

The  free  vibration  method  worked  very  well  with  Example  1. 

Conclusions 

Two  methods  are  presented  to  identify  the  vibration  parameters  of  a linear 
system.  The  forced  vibration  method  recovers  all  the  parameters  of  the 
system.  Further  research  is  needed  to  determine  the  optimal  excitation  of  the 
structure  to  maximize  observability  and  computability  of  the  system  parameters. 
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JOINT  STATE  AND  PARAMETER  ESTIMATION 

N.  Carmichael,  University  of  Warwick,  Coventry,  U.K.  CV4  7AL 
M.D.  Quinn,  Sheffield  City  Polytechnic,  Sheffield,  U.K.  SI  IWB 


ABSTRACT 


Recent  results  are  described  concerning  the  problem  of 
state  reconstruction  for  a class  of  nonlinear  distributed  systems. 
The  methods  are  then  applied  to  the  problem  of  joint  state  and 
parameter  estimation.  An  example  with  the  wave  equation  is 
presented. 


1.  INTRODUCTION 


An  observed  semi-linear  system  of  evolution  equations  can  be 
described  by 


z = Az  + f(z),  z(0)  = Zq 

y = Cz 


1 


(1) 


where  z denotes  the  system  state  lying  in  an  appropriate  Banach 
space,  z^  is  the  initial  state,  A is  a linear  operator,  f is  a 

non-linear  operator,  y denotes  the  output  and  C is  a linear  output 
operator.  The  objective  is  to  consider  the  state  reconstruction 
problem  for  infinite  dimensional  systems  of  this  form.  There  is  a 
substantial  theory  already  available  for  state  reconstruction  in 
linear  systems  described  by 


z = Az 
y = Cz 


} 


(2) 


In  this  paper  we  use  the  results  from  linear  theory  together 
with  suitable  fixed-point  theorems  to  find  a solution  to  the  state 
reconstruction  problem  for  (1) . We  apply  these  results  to  the 
joint  state  and  parameter  estimation  problem  where  the  parameters 
are  assumed  to  be  constants,  and  the  system  has  nonlinearities  in 
both  state  and  parameter,  (see  [1],  [3]) 
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2.  STATE  RECONSTRUCTION 


Consider  the  linear  system  (2)  and  assume  that  A generates 
a strongly  continuous  semigroup  S(t).  Then  (2)  is  said  to  be 
initially  observable  if  the  map  from  the  initial  state  to  the 
output  trajectory 


H 


CSC')z 


0 


is  injective,  i.e.  Ker  H = {ol.  The  system  (2)  is  said  to  be 
continuously  initially  observable  if  Ker  H = {o}  and  Range  H 
is  closed.  This  latter  definition  means  that  we  are  able  to 
reconstruct  the  initial  state  given  the  output  trajectory 

y(*).  That  is  to  say  we  may  construct  a linear  bounded  inverse 
of  H,  denoted  which  has  as  its  domain  space  the  chosen 

space  of  output  trajectories  (this  space  is  usually  larger  than 
the  range  of  H) . Of  course,  as  elsewhere  in  partial  differential 
equations  much  depends  on  the  particular  function  spaces  used. 

We  proceed  formally  by  writing  down  the  mild  solution  of  system  (1) 


z(t)  = s(t)z  + { S(t-s)f (z(s))  ds 

(3) 

y(t)  = CS(t)z  +cf  S(t-s)f (z(s))  ds 
° '^0 

(4) 

Then  the  map  H is  defined  by 

H z = y(«)  “ C r S(  -s)f(z(s))  ds 
° '*0 

(5) 

therefore 

z = H'^CyC-)  - cf  S(‘ -s)f  (z(s))  ds) 

0 Jq 

(6) 

Now  substituting  (6)  into  (3)  we  obtain 

z(t)  = S(t)H~^(y(')  - C r S(* -s)f(z(s))ds)+  f S(t- 

'^0  '^0 

-s)f  (z(s))ds 
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(c)  B = {ze 


L’"(0,t 


rV 


^ a 


and 

(Rc^  + c^)  a ( I |z  I I,  I |z  I I)  $ K $ 1 
for  z,  ze  Ba  ; and  where  R is  defined  by 

llcl^  '=<--K)t(-C»)d2r||  s 


(■^  denotes  the  space  in  which  the  output  trajectories 
lie) 


then;  the  state  of  the  system  described  by  (1)  can  be 
reconstructed  given  an  observation,  y(«),  satisfying 

^ a 

C2I |h’1 

Proof 


m.r) 


ly(-)|| 


See  [1]. 


3.  JOINT  STATE  AND  PARAMETER  ESTIMATION 
Consider  a nonlinear  system  of  the  form 
z = g(z.“,  t)  , z(0)  = 

y = h(z,a,  t) 

where  z is  the  state,  y is  the  output  and  a is  a vector  of, 
a priori  unknown,  parameters.  Taking  a local  approximation  of 
these  equations  about  an  initial  guess  (z('),oi)  we  assume  that 
we  can  obtain  a system 

z = Az  + A^0(  + f(z,a)  I (9) 

y = Cz 

where  A,  A^^,  C are  time-independent  linear  operators. 
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The  right-hand  side  of  (7)  defines  a map  0 operating  on  the 
space  of  trajectories  z(*).  A fixed  point  of  this  map  $ 
will  be  consistent  both  with  the  state  equations  (3)  and 
the  output  equation  y = Cz  (since  of  z*  is  such  that 
z*  = $z*  then  Cz*  = COz*  = y) . Thus  we  have  reduced  the 
reconstruction  problem  to  finding  fixed  points  of  the  nonlinear 
map  In  general,  the  results  obtained  are  local,  both  in 
the  space  of  trajectories  and  in  time. 

There  are  several  fixed-point  theorems  available  for  use 
(see  [1],  [3])  but  here  we  state  a theorem  using  a local 
version  of  the  contraction  mapping  theorem. 

Theorem 


Suppose  that  A generates  a strongly  continuous  semigroup 
S(t)  and  that  the  linear  system  (2)  is  continuously  initially 
observable.  Let  Z,  Z^  be  Banach  spaces  and 

^ X 1.  1 

p,r,q,s,a,Cj^,C2,K,R  with  r^l,  p^q^l,-  = - + j-l,  s^l. 

Assume 

(a)  S(t)e^(Z^,Z)n/.(Z2,Z) 
with 

l|s(t)z||2  ^ » I Ig^l  lL^(0,t^)  ~ ^ ” 

||S(t)z||2  ^ g2(t)||z|l^^  , ll82MLP(o.tp  = C2  < CO 

(b)  for  some  a :r  X -r  which  is  continuous  and 

symmetric  with  a(6,0)  0 as  9 0 we  have 

f ; l’^(0,t^;Zj^)  L®(0,tj^;Z2)  satisfying 

Mf(z)  - f(z)|  $ a(||z|l  . MzM)||z  - z|i 

the  norms  on  the  right-hand  side  computed  in  L^(0,t^,Zj^)  . 
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In  practice,  however,  we  may  have  a time  varying  linear  part, 
a non-linear  part  in  the  output  equation  and  known  functions 
in  both  the  state  and  output  equations.  All  these  factors 
entail  modifications  to  the  detail,  but  not  necessarily  the 
essence,  of  the  present  approach. 

Assume  that  the  parameters  are  constant  and  finite-dimensional 

so  that  a = 0 and  oi£  Let  denote  the  augmented  state 

and  so  the  system  is  described  by 


The  joint  state  and  parameter  estimation  problem  is  to  reconstruct 
the  state  z(t)  (t  5 0)  and  the  parameters  a,  given  y(»)  on  [0,t^] . 

This  has  now  been  reduced  to  the  state  estimation  (for  the 
augmented  state)  case  of  Section  2.  To  apply  the  result  of 
Section  2 we  require  that  the  linear  part  of  (10),  (11)  be 
continuously  observable. 


Lemma 

Suppose  that  the  pair  (A,C)  is  continuously  initially  observable 
and  that  the  linear  part  of  (10),  (11)  describes  a well-defined 
dynamical  system.  If  the  map 


fA  A,1  . . . 

1 I IS  injective 

L C 0 J 

then  the  linear  part  of  (lO),  (11)  is  continuously  initially 
observable . 
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Proof 


Let  t = 0 then  Cz  =0.  Hence  Cz^  = Az  + A.  oi  = o 
0 0 0 1 0 

which,  by  the  injectivity  assumption,  is  only  possible 
when  ~ 0»  =0.  Therefore  the  map  defined  by  (13) 

is  injective  and  so  the  linear  part  of  the  augmented  system 
is  initially  observable.  Since  we  have  assumed  that  (A,C) 
is  continuously  initially  observable  and  the  parameter-space 
is  finite  dimensional  then,  by  (13),  the  range  of  F is  closed. 
Hence  the  linear  augmented  system  is  continuously  initially 
observable . ■ 

The  theorem  of  Section  2 can  now  be  applied  with  the 
additional  assumption  that 

^1  = {0}  . 

0 . 
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This  will  provide  (at  least  in  the  contraction  mapping  case) 
both  conditions  for  the  existence  of  a fixed  point  and  ah 
algorithm  (successive  approximation)  for  finding  it. 

Further  development  of  this  method  will  consider  the  case 
when  the  original  system  (A,C)  is  only  initially  observable  and 
also  the  case  when  the  parameter-space  is  not  finite 
dimensional  (e.g.  spatially  varying  parameters). 


4 . EXAMPLE 


Here  we  apply  the  results  of  the  preceding  sections  to  the 
observed  one-dimensional  wave  equation 


w = w + aw 

tt  XX 

(15) 

w(0,t)  = w(l,t)  = 0 

where  ae  IR  is  the  unknown  parameter,  and  the  observation 
is 

y(*)  = 1 c(x)  w(x,.)dx  = <c,w  > 

■^0 


Taking  initial  guesses  for  state  and  parameter  as  w,  a 
respectively  (w  does  not  depend  on  t,  w(0)  = w(l)  = 0 
and  5=0  for  simplicity)  we  then  make  the  usual  local 

approximation  w = w + w',  a = a + a'  to  obtain 


w'  = w*  + woi’  + w + a'w' 

tt  XX  XX 


(16) 


with  w'(0,t)  = w'(l,t)  = 0 
The  observation  is  now  given  by 


y(* ) 


<c,w’(.)>j^2(o^l)  + 


(17) 


Substituting  z 


1 

I , in  the  usual  way,  we  obtain  a 
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system  in  the  form 

2 = Az  + Aj^oi  + f(z,ot)  + u 
y = Cz  + V 

where  u,v  are  known  inputs 


A standard  result  shows  that  (A,C)  is  continuously 
initially  observable  if  y,  the  space  of  output  trajectories, 
consists  of  functions  of  the  form 


y(t)  = (a  cosn/^t  + b sin  nT^t) 
, n n 

n=l 


normed  by 


2 Jlf  2 ^ . 2n 
E n 7^  (a^,  + Pn  > 

n=l  c ^ 

n 


Now,  using  the  lemma  in  Section  3,  continuous  initial 
observability  for  the  linear  part  of  the  augmented  system 
obtained  if 


E 

n=l 


c w 
n n 


2 

n 


+ 0 


for  w = <w,<t>  >_  1,^  , 

n ’ n L (0,1) 


where  = /2  sin  nTCx.  Hence  we  may  apply  the  theorem  of 

Section  2 (suitably  modified  to  account  for  the  known  inputs 
u and  v)  to  jointly  identify  initial  state  and  parameters.  In 
the  present  example  we  have  a (9 ^,02)  ~ c(9^  + 62)  for  part  (b) 

of  the  theorem,  where  c is  some  constant.  The  principles 
illustrated  here  can  be  extended  to  create  joint  and  parameter 
estimators  for  other  hyperbolic-type  equations. 
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ABSTRACT 


Truss  beams  with  members  having  viscous  damping  are  modeled  witha 
Timoshenko  beam.  Procedures  for  deriving  the  equivalent  bending  rigidity, 
transverse  shear  rigidity,  and  damping  are  presented.  Explicit  expressions 
for  these  equivalent  beam  properties  are  obtained  for  a specific  truss  beam. 
The  beam  model  thus  established  is  then  used  to  investigate  the  effect  of 
damping  in  free  vibration.  Finally,  the  beam  is  employed  in  the  estimation 
of  structural  parameters  in  a simply-supported  truss  beam  using  a random 
search  algorithm. 


INTRODUCTION 


Many  methods  currently  employed  for  the  dynamic  analysis  of  structures 
such  as  the  finite  element  method  are  adequate  and  accurate.  However,  as 
the  system  becomes  large,  direct  application  of  the  finite  element  method 
is  found  to  be  computationally  expensive  and  its  accuracy  degenerates. 
Moreover,  as  each  individual  structural  component  is  taken  into  account  in 
the  formulation,  the  number  of  structural  parameters  is  large  which  makes 
the  system  identification  very  difficult. 

In  view  of  the  fact  that  many  large  space  structures,  although  large 
in  size  and  complex  in  detail,  often  appear  grossly  as  a beam,  plate,  or 
thin  shell.,  some  researchers  have  derived  equivalent  combinuum  models  for 
dynamic  analyses  [1-3].  These  simple  continuum  models  have  proven  to  be 
quite  accurate  in  predicting  natural  frequencies  and  mode  shapes  for  a 
significant  number  of  modes.  Since  these  equivalent  continuum  models  have 
few  structural  parameters,  they  are  particularly  useful  in  system 
identification  problems  [4]. 


* 
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structural  damping  plays  an  important  role  in  the  prediction  of  space- 
craft structural  responses  and  loads  [5]  which  are  needed  in  the  structural 
and  control  design.  As  the  aerodynamic  damping  is  absent  in  the  space 
environment,  damping  in  a spacecraft  results  from  the  deformation  in  each 
structural  member  and  joint.  The  contribution  to  the  overall  structural 
damping  from  each  member  with  known  damping  properties  can  be  theoretically 
estimated.  In  this  paper,  procedures  are  presented  for  deriving  the  gross 
damping  properties  for  the  simple  beam  model  which  is  used  to  represent 
beam-like  trusses.  The  beam  model  with  damping  thus  constructed  is  then 
used  to  study  the  effect  of  damping  in  free  vibration  of  a simply-supported 
truss  beam.  Finally,  this  model  is  employed  in  the  estimation  of  structural 
parameters  of  the  truss  beam  by  using  a random  search  algorithm. 


BEAM  MODEL  WITH  DAMPING 


Detailed  procedures  for  deriving  the  equivalent  beam  rigidities  from 
a turss  beam  were  given  by  p].  Among  the  three  beam  models,  shear  beam, 
Bernoulli-Euler  beam,  and  Timoshenko  beam,  the  Timoshenko  beam  theory  was 
found  to  be  most  accurate  and  most  suitable  to  represent  general  trusses. 
In  this  study,  the  Timoshenko  beam  model  is  used. 


For  simplicity,  consider  a beam  which  is  symmetric  with  respect  to  the 
mid-plane  so  that  the  extension  is  not  coupled  with  the  flexural  deforma- 
tion. The  shear  force  Q and  bending  moment  M are  related  to  the  transverse 
displacement  w and  the  rotation  i|)  as 


(1) 


wnere  i 
023  is 


iM  IS  ine  transverse  sne< 
the  bending-shear  coupT 


1 1 1 ^ Ill’ 


bending  rigidity,  and 


The  displacement-equations  of  motion  are  [1] 


El  - 6A(w,^  + i|<)  = pi 
GA  (w,^^  + = iii  w - q 

where  ^I^  is  rotatory  inertia,  ¥ is  mass  per  unit  length,  and  q is  transverse 
load  per  unit  length. 

The  bending  rigidity  El  and  shear  rigidity  M are  obtained  from  a 
typical  substructure  by  applying  a bending  moment  and  shear  force,  respec- 
tively, as  shown  in  Fig.  1.  They  are  expressed  as  [1] 
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Substructure  in  Bending 


Fig  1 Substructure  in  transverse  shear  and  bending  deformations. 


_ _ 1 
GA  ” 0 


(3) 


The  deformations  and  0,  can  be  calculated  analytically  or  numerically  using 
the  matrix  method.  From  the  solutions  for  the  substructure,  we  also  obtain 
the  uniaxial  strains  in  the  truss  members  as 


for  the  bending  deformation,  and 


for  the  transverse  shear  deformation.  In  Eqs.  (4-5),  the  rotation  gradient 
i|j,„  and  the  transverse  shear  strain  w,+4)  are  approximated  by 

X ^ 
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(6) 


and 


w,^  + = 0 


(7) 


respectively. 

stress-strain  relation  is  given 


(8) 

In  Eq.  (8),  a is  the  normal 
normal  strain,  d is  the  damping 
coefficient,  and  a subscript  i denotes  member  i. 

It  is  assumed  that  the  damping  is  small  so  that  the  damping  force  is 
much  smaller  than  the  elastic  force.  As  a result  of  this  assumption,  the 
elastic  relations  given  by  Eqs.  (4-5)  are  taken  also  for  nonvanishing 
strain  rates,  i.e.. 


e.  = a . i. 

(9) 

~ *^i  ^'^’x 

(10) 

Consider  truss  member  i for  which  the 
by 


in  which  the  viscous  damping  is  included, 
stress,  E is  the  Young's  modulus,  e is  the 


The  resulting  damping  forces  in  the  truss  members  computed  according  to 
Eqs.  (8-10)  are  then  added  to  the  elastic  forces.  Equation  (1)  then  takes 
the  new  form 


eI 


'23 


'33 


w,x+^ 

’i'.v 


(11) 


where  the  equivalent  global  damping  coefficient  C. . are  functions  of  the 
damping  coefficients  and  elastic  properties  of  the"^ truss  members. 


As  an  illustration,  consider  the  substructure  shown  in  Fig.  1 with 


and  that  all  the  members  have  the  same  elastic  constants.  The  simple 
bending  and  transverse  shear  deformations  yield  the  following  axial  strains 
in  the  truss  members  as 
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From  the  above  relations,  we  obtain  the  equivalent  elastic  transverse 
shear  force  and  bending  moment: 


for  which  we  have 


El  = i A E 


(16) 
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The  additional  transverse  force  and  bending  moment  due  to  damping  are 
calculated  using  Eqs.  (12-13).  We  obtain 


I damping 


2 

9 c A ^ 
, 2 ^ , 2v3/2 


lW,^4l 


(17) 


Thus,  the  equivalent  global  damping  coefficients  for  the  beam  model  are 

2 

2Lg 

^22 " ^3^3 


'33  " 1 ^g  ^^1 


(18) 


^23  ^32  ° 


Combining  Eqs.  (14)  and  (17),  we  obtain 


Q = GA  (w,^+i]j)  + C22  (w,j^+i) 

M = ET  ,K,^  + C33 

The  corresponding  displacement-equations  of  motion  are 

(El  - GA(w,^+,i,)  + (C33 

- C22(w,^+ii<)  = pi  <1^ 

[GA(w,^+ijj)J,^  + [C22(w,j^+'i')].j^  = mw  - q 


(19) 


(20) 


(21) 
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PARAMETER  ESTIMATION  - RANDOM  SEARCH  METHOD 


The  Timoshenko  beam  equations,  Eqs.  (20-21),  and  the  measurement  equa- 
tion can  be  expressed  in  the  form 


Z = Dz  + Kz  + Bu  , Xefi  , teT 
y = Cz 


(22) 


T T - -.1 

where  z = (z-j,  Z2)  = (w,  ijj)  is  the  distributed  state  vector,  u = (m  q,0) 

is  a p-vector  control  input,  teT(  = (t^,  t^))is  time,  x is  the  spatial 

coordinate  in  domain  fi,  D and  K are  linear  matrix  differential  operators 
in  given  by 


in  which 


D = 


K = 


640^ 

34D 

350 

6-|D^ 

3-|D 

-62D 

33D 

( 3-|  > 62  ’ 

33.  34 

, D = 8/3X 


(23) 


(24) 


(GA/m,  GA/pI,  EI/pI,  022/^*  C22/pI»  ^33/p^) 


is  a vector  of  all  the  parameters  in  the  simple  continuum  model,  and  B and 
C are  finite-dimensional  influence  matrix  functions  or  integral  operators. 

Of  practical  interest  are  the  following  boundary  conditions: 

(1)  Fixed- Fixed  Beam: 

z(0,  t)  = z(£,  t)  = 0 (25) 

(2)  Simply  Supported  Beam: 

rz(0,  t)  =rz(£,  t)  = 0 (26) 

where 
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r 


1 


0"^ 

D 


(3)  Cantilever  Beam: 

z(a,  t)  = rz(o,  t)  = 0 


where 


(27) 


(28) 


The  output  error  is  given  by 

e(t)  = y(t)  - y(t)  (y^-y^ V'^m^ 

where  y is  the  actual  measurement  vector. 

The  error  cost  function  is  defined  by 


J(3)  = 


T 

e 


Re  dt 


(30) 


in  which  R is  a positive  definite  weighting  matrix.  The  objective  of  up- 
dating the  simple  model  is  to  adjust  3 so  that  J(b)  achieves  the  minimum. 

The  estimation  procedure  of  the  random  search  method  has  been  success- 
fully employed  for  parameter  estimation  for  the  Timoshenko  beam  model  with- 
out damping  [4].  The  procedure  is  briefly  outlined  as  follows.  We  start 
with  an  initial  estimate  of  parameter  3 obtained  from  the  simple  model. 
Then,  an  iterative  process  follows  to  match  the  test  data  by  updating  the 
analytical  model.  Specifically,  at  the  ith  iteration,  the  parameter 
vector  3||  is  given  by 

3|i  A3j  3|^_1  - (B^j  ^2*  •••»  ^6^i-l  (^^^ 

where  the  increment  A3  is  given  by 

A3  - (i  oiiB-j  > i 0,2^2*  • • • » — (32) 

in  which  a.  (=N(a,  y))  is  a Gaussian  distributed  random  number  with  mean  a 
and  standard  deviation  y which  are  determined  by  the  accuracy  requirement 
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of  parameter  estimation.  For  example,  the  (1+0.25)%  error  limit  can  be 
provided  by  selecting  a = 0.01  and  y = 0.0025.  The  values  of  ai  (i  = 1-6) 
are  fixed  for  each  iteration.  The  constant  a^.  should  be  carefully  chosen 

as  this  scheme  may  diverge  if  a^.  are  too  large.  On  the  other  hand,  if  a^. 

are  too  small,  then  3 would  converge  to  true  values  very  slowly.  Note  that 
the  plus  and  minus  signs  in  A3  result  in  2°  trial  samples  of  3|^. . 

For  each  iteration,  the  following  steps  are  carried  out. 

Step  1:  Compute  Gaussian  distributed  random  numbers  ct- 

vJ 

step  2:  Discretize  the  continuum  model  (22)  with  given  parameters  3| • to 

obtain  a finite  dimensional  model  including  all  the  chosen  natural 
modes. 

Step  3:  Compute  cost  function  J(bI-)  (see  Eq.  (30))  by  integrating  the 

square  of  the  measurement  ^ error  between  the  analytical  model 
and  the  actual  system. 

Step  4:  Go  back  to  step  2 with  different  3 resulting  from  the  different 

plus  and  minus  combinations  in  Eq.  (32)  until  2^  trial  samples 
of  3|^  run  out. 

Step  5:  Obtain  the  direction  A3.-  of  the  smallest  J_(3|.)  in  the  group  of 

f\  ^ SI 

2 cost  function  0(3l.j).  If  05(3!^)  > 9°  to  step  8. 

Step  6;  Perform  a one-dimensional  minimization  of  J(b)  along  the  direction 
ABg  until  the  slop  of  the  cost  function  J(b)  changes  sign. 

Step  7:  Update  3L  at  which  cost  function  J(3)  is  minimum  and  return  to 

Step  1 . 

Step  8:  Minimization  is  completed. 


EXAMPLE 

Consider  a simply-supported  truss  beam  for  which  the  dimension  and 
properties  of  the  truss  members  are  given  in  Fig.  2.  In  this  figure,  A^, 

Ag,  and  A^  indicate  the  member  cross-sectional  areas,  E is  the  modulus  of 

elasticity.,  and  p is  the  mass  density.  The  damping  coefficients  will  be 
assumed  as  variables. 

The  equivalent  bending  and  transverse  shear  rigidities  for  the 
Timoshenko  beam  model  are  Computed  from  Eqs.  (15-16),  and  the  equivalent 
mass  m and  rotatory  inertia  ^ are  obtained  using  the  procedure  suggested 
by  [1].  We  have 
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7.5M  X 20  = ISOM 


E = 71.7  X 10®  [N/m=]  , p = 2768  [kg/m®] 

Ac  = 80  X 10-«  [m®]  , Ag  = 60  X 10‘®  [m®] 

Ad  = 40  X 10-6  [n,2] 


Fig.  2 Simply-supported  truss  beam. 

M = 1.468x10® 
rr  = 7.17x10^ 
iii  = 0.875 
pi  = 3.55 


N-m^ 

? 7 

N-sec  /m 


N-sec 


(33) 


Damped  Free  Vibration 

Free  vibration  (q=0)  of  the  simply-supported  Timoshenko  beam  can  be 
studied  using  Eqs.  (20-21)  or  Eq.  (22)  with  the  boundary  conditions, 

Eq.  (26).  The  frequency  equation  is  obtained  as 

+ h^  A^  + h£  A^  + h.jA  “ 0 (34) 

where 

ho  = (jd^  ; = (rtr/£)^  , r = 1,  2,  ... 

hi  = (3^6g  + 3334)^^ 

h2  = (3-j  + 33)0)^.  + $2  + 3435  coj, 

^3  " ^®4  ®6^“r  ^5 
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The  biquadratic  equation,  Eq.  (34),  can  be  solved  numerically  yielding  an 
infinite  number  of  solutions  A in  the  form 


(1) 


^ + i ; i = /^1 


(2)  (2)  . (2) 


(35) 

(36) 


where  and  are  positive  quantities, 


The  two  roots  A^^^  and  A^^^ 


(2) 


represent  two  different  branches  of  deformation  with  one  frequency  u' 

n 1 " 

much  larger  than  . The  branch  with  the  lower  frequency  is  of  practical 
interest  and  henceforth  will  be  taken  for  further  discussions. 


Corresponding  each  eigenvalue  A , we  obtain  the  mode  shape  of  deflec- 
tion as 

X t 

w = e sin  (rTrx/J.) 

> (37) 

A 

e cos  (rTTx/S-) 


where  the  amplitudes  W and  "i*  are  related  by 

==  W^[-A^  - (s^  + 64  A^)  + B^A^)  (rTT/A)  (38) 

For  the  following  three  sets  of  damping  coefficients. 


^22 

= 0 

N-sec  , 

C33  = 4x10^ 

2 

N-m  -sec 

^22 

= 200 

N-sec  , 

^33  " ^ 

2 

N-m  -sec 

^22 

= 100 

N-sec  , 

C33  = 4x10^ 

2 

N-m  -sec 

the  frequencies  of  the  lowest  branch  and  the  corresponding  damping  ratios 
are  presented  in  Table  1.  Only  the  modes  of  odd  numbers  are  listed. 

The  numerical  results  shown  in  Table  1 reveal  an  interesting  damping 
characteristic  in  Timoshenko  beam,  i.e.,  the  amount  of  damping  due  to  bend- 
ing increases  initially  as  the  frequency  increases,  but  decreases  after 
the  eleventh  mode.  On  the  other  hand,  the  amount  of  damping  due  to  the 
transverse  shear  deformation  is  larger  for  higher  modes.  For  the  lower 
modes,  the  transverse  shear  damping  is  negligible  as  the  bending  deforma- 
tion dominates  in  the  lower  mode  vibration.  When  both  bending  and  trans- 
verse shear  dampings  are  present,  the  former  is  dominant  in  the  lower 
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Table  1 

Natural 

Frequencies 

and  Damping  Ratios 

Damping 

0^3  = 4x10 

4 r = 0 

C33  = 0,  C 

22  "" 

C33  = 4x104 

, C22  =100 

^S^eq. 

Frequency 

Damping 

Frequency 

Damping 

Frequency 

Damping 

ModeNs^ 

(rad/sec) 

Ratio (%) 

(rad/ sec) 

Ratio(%) 

(rad/sec) 

Ratio(%) 

1 

3.691 

0.094 

3.691 

0.00051 

3.691 

0.094 

3 

30.66 

0.673 

30.66 

0.032 

30.66 

0.689 

5 

74.99 

1.303 

74.99 

0.169 

74.99 

1 .388 

7 

127.4 

1.699 

127.4 

0.422 

127.4 

1.910 

9 

182.8 

1.869 

182.6 

0.753 

182.7 

2.246 

11 

239.0 

1.891 

238.6 

1.126 

239.0 

2.458 

13 

295.0 

1.834 

294.4 

1.519 

295.0 

2.599 

15 

322.9 

1.789 

349.8 

1.918 

350.4 

2.707 

17 

406.0 

1.626 

404.7 

2.318 

406.0 

2.801 

19 

460.8 

1.512 

459.2 

2.716 

433.0 

2.845 

modes  and  the  latter  is  dominant  in  the  higher  modes  as  revealed  by  the 
results  for  the  third  set  of  damping  coefficients. 

Since  a truss  beam  has  been  shown  to  behave  like  a Timoshenko  beam 
[1],  its  motion  at  higher  modes  is  dominated  by  transverse  shear  deformations. 
Thus,  there  may  exist  a lower  bound  frequency  which  could  give  rise  to 
enough  damping  ratio  specified  by  the  control  designer.  Moreover,  the 
control  and  observation  spillover  may  be  overcome  by  the  material  damping 
and  no  complicated  control  compensations  are  needed.  In  that  case,  control 
designs  based  upon  discrete  models  without  residual  modes  may  be  justified. 


Parameter  Estimation 


In  the  undamped  or  purely  elastic  case,  the  truss  beam  (see  Fig,  2) 
can  be  analyzed  by  using  the  finite  element  method  to  model  each  individual 
truss  member  as  a rod  element.  The  solution  thus  obtained  can  be  regarded 
as  the  "exact  solution"  except  for  very  high  modes.  This  exact  solution 
can  be  used  as  a basis  for  estimating  the  accuracy  of  the  simple  continuum 
model.  For  the  simply-supported  beam  shown  in  Fig.  2,  the  first  five 
undamped  natural  frequencies  obtained  from  the  finite  element  program  SAPIV 
and  from  the  Timoshenko  beam  model  are  presented  in  Table  2.  The  corres- 
ponding modal  displacements  at  the  mid-span  are  also  given  in  this  table. 

It  is  evident  that  the  simple  model  yields  excellent  frequency  and  displace- 
ment predictions.  It  is  reasonable  to  say  that  the  simple  model  provides 
a good  initial  estimate  of  the  structural  properties  of  the  actual  system. 
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Table  2 Actual  and  Updated  Simple  Model 
Solutions  for  the  Truss  Beam 


Finite  Element 

Simple  Model 

Updated  Simple  Model 

Mode 

Frequency 

(v’ad/sec) 

Modal 
Di spl . 

Frequency 

(rad/sec) 

Modal 
Displ . 

Frequency 

(rad/sec) 

Modal 
Displ . 

1 

3.684 

0.124 

3.564 

0.121 

3.692 

0.121 

2 

14.32 

0.0 

13.84 

0.0 

14.32 

0.0 

3 

30.84 

-0.123 

29.75 

-0.121 

30.75 

-0.121 

4 

51.87 

0.0 

49.95 

0.0 

51.69 

0.0 

5 

76.30 

0.123 

73.20 

0.121 

76.08 

0.121 

When  damping  is  present,  we  will  still  take  the  finite  element  solu 
tion  as  the  actual  solution  and  the  simple  model  (with  damping)  will  be 
updated  using  the  random  search  algorithm  described  in  the  previous  sec- 
tion. A single  noise-free  displacement  sensor  and  a force  actuator  are 
chosen  and  co-located  at  the  center  of  the  span  for  input  and  output 
measurements. 

For  a unit  impulse  applied  at  the  raid-span,  the  response  of  displace 
ment  at  the  mid-span  can  be  written  in  the  form 

n .p  n.t 

A 

where  rij  + iwj  = i = is  the  transverse  modal  displacement  at 

the  mid-span  of  the  beam  corresponding  to  the  eigenvalue  A-,  and  n is 
the  total  mode  number  to  be  included  in  the  simulation.  ^ 

The  cost  function  J(g)  given  by  Eq.  (30)  with  R chosen  as  a unity 
matrix  is  obtained  by  exciting  both  the  actual  system  and  the  simple 
model  with  the  same  impulse  input.  For  the  purpose  of  simulation,  the 
full-scale  finite  element  modeling  of  the  actual  truss  beam  (without 
damping)  is  regarded  as  the  actual  system.  In  updating  the  simple  model 
only  the  bending  rigidity  n and  the  transverse  shear  rigidity  GA  are 
varied  while  the  damping  coefficients,  C22  ~ ^33  “ 4x10''*-,  are  kept 

constant.  By  using  the  random  search  algorithm,  the  bending  and  trans- 
verse rigidities  which  minimize  the  cost  function  are  found  to  be 

IT  = 7.7x10^  N-m^ 

1.499x10®  N 
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The  natural  frequencies  and  modal  displacements  for  the  first  five  modes 
calculated  using  these  updated  beam  rigidities  are  shown  in  the  last 
column  in  Table  2.  It  is  evident  that  these  frequencies  of  the  updated 
simple  model  are  all  within  0.3%  of  the  actual  solutions.  The  mode 
shapes  remain  unchanged  since  they  do  not  depend  on  the  bending  and  shear 
rigidities  which  are  represented  by  the  parameters  g-j , B2>  and 

It  is  noted  that  in  the  updating  process,  the  bending  rigidity  is 
modified  more  than  the  transverse  shear  rigidity.  However,  one  should 
expect  a greater  modification  on  the  transverse  shear  rigidity  when 
more  higher  modes  are  included. 


CONCLUDING  REMARKS 


In  this  paper,  a simple  Timoshenko  beam  model  is  developed  for  re- 
presenting truss  beams.  Viscous  damping  in  truss  members  is  included  in 
the  formulation.  This  model  makes  it  feasible  to  perform  the  control 
analysis  by  either  using  partial  differential  equations  or  systems  of 
ordinary  differential  equations  obtained  from  the  discretization  of  the 
simple  model. 

The  simple  model  thus  constructed  is  shown  to  yield  quite  accurate 
predictions  in  natural  frequencies  and  mode  shapes.  Moreover,  with  much 
fewer  parameters  than  the  finite  element  model,  it  also  provides 

an  efficient  way  for  performing  system  identification  when  actual 
measurements  on  the  system  are  available.  This  feature  of  the 
simple  model  is  particularly  attractive  if  in-flight  tests  are  to 
be  implemented  and  results  used  for  updating  the  structural  model. 

The  random  search  algorithms  used  in  this  study  is  efficient 
when  the  number  of  parameters  to  be  identified  is  small.  The 
Newton-Raphson  method  [6]  will  be  employed  and  compared  with  the 
random  search  method  in  a future  report. 
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ABSTRACT 


All  approach  is  advanced  for  the  concurrent  estimation  of  the  state  and 
of  the  model  errors  of  a system  described  by  elliptic  equations.  The 
estimates  are  obtained  by  a deterministic  least-squares  approach  that  seeks 
to  minimize  a quadratic  functional  of  the  model  errors,  or  equivalently,  to 
find  the  vector  of  smallest  norm  subject  to  linear  constraints  in  a 
suitably  defined  function  space.  The  minimum  norm  solution  can  be  obtained 
by  solving  either  a Fredholm  integral  equation  of  the  second  kind  for  the 
case  with  continuously  distributed  data  or  a related  matrix  equation  for 
the  problem  with  discretely  located  measurements.  Solution  of  either  one 
of  these  equations  is  obtained  in  a batch-processing  mode  in  which  all  of 
the  data  is  processed  simultaneously  or,  in  certain  restricted  geometries, 
in  a spatially  scanning  mode  in  which  the  data  is  processed  recursively. 
After  the  methods  for  computation  of  the  optimal  estimates  are  developed, 
an  analysis  of  the  second-order  statistics  of  the  estimates  and  of  the 
corresponding  estimation  error  is  conducted.  Based  on  this  analysis, 
explicit  expressions  for  the  mean-square  estimation  error  associated  with 
both  the  state  and  model  error  estimates  are  then  developed.  While  this 
paper  focuses  on  theoretical  developments,  applications  arising  in  the  area 
of  large  structure  static  shape  determination  are  contained  in  a closely 
related  paper  [1]  also  included  in  these  proceedings. 


I.  INTRODUCTION 


Obiectives 

The  main  objectives  of  the  paper  are:  1)  to  outline  an  approach  to 

estimate  the  state  of  a system  modeled  by  means  of  single  or  multiple 
elliptic  partial  differential  equations  or,  for  complex  structures,  by 
fine-resolution  piecewise-continuum  models,  and  2)  to  advance  a closely 
related  approach  to  estimate  the  errors  inherent  in  such  models. 

Mathematical  Statement  of  State  and  Model  Error  Estimation  Problems 

A general  statement  of  the  estimation  problems  is  provided  by  the 


following  equations: 

System  Model 

Au  = f. 

(1) 

Observations 

y = Hu  + n. 

(2) 

Boundary  Conditions 

Bu  = 0, 

(3) 
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where  u and  y are  respectively  the  state  and  observations,  and  A is  an 
elliptic  differential  operator  defined  over  the  spatial  domain  i2.  As 
outlined  in  Ref,  [IJ,  the  operator  A could  represent  symbolically  a large 
number  of  interconnected  partial  differential  equations  defined  over  a 
multiplicity  of  sub-domains  that  together  make  up  the  overall  structural 
domain  Q.  Such  fine-resolution  piecewise-continuum  models  would  be 
appropriate  for  complex  structures  that  cannot  be  modeled  completely  by  a 
single  partial  differential  equation.  The  boundary  conditions  Bu  = 0 are 
specified  at  the  boundary  of  the  domain,  where  B is  a differential  operator 
that  characterizes  the  geometric  and  natural  boundary  conditions  applicable 
to  the  system  model.  The  range  of  the  operator  H could  be  finite- 
dimensional, corresponding  to  the  case  of  discretely  located  observations. 

The  traditional  state  estimation  problem  consists  of  estimating  the 
state  u of  the  system  from  the  observations  y.  A related  estimation 
problem,  and  one  that  is  less  commonly  addressed,  is  that  of  estimating  the 
model  errors  f and  n,  which  reflect  the  combined  effect  of  a large  number 
of  uncertainties  and  error  sources.  In  many  cases,  these  two  terms  cannot 
be  fully  determined  in  advance  of  the  model's  application.  The  traditional 
'white-noise'  model  does  not  always  lead  to  useful  results.  Consequently, 
estimation  is  the  only  viable  means  to  establish  knowledge  of  the  model 
errors. 

Specific  cases  in  which  the  problem  of  model  error  estimation  occurs 
include:  determining  the  quasi-static  gravity  loads  in  a ground  antenna, 
estimating  internal  disturbances  (forces  and  moments)  due  to  attitude  and 
shape  control  actuators,  and  evaluating  the  lumped  effect  of  nonlinear 
terms,  governing  nonlinear  elastic  behavior,  which  have  been  neglected  by 
the  approximate  linear  model  in  (1). 

The  estimation  approach  advanced  here  provides  a solution  to  both  of 
the  foregoing  problems  simultaneously,  in  that  the  same  computations 
required  to  determine  the  state  estimates  can  be  used  (with  minor 
modifications)  to  determine  the  corresponding  model  error  estimates. 

Least-Squares  Approach  to  Estimation 

The  method  of  least-squares  seeks  to  minimize  the  following  quadratic 
functional  of  the  model  errors: 


J(f)  = / (n^  + f2)  dQ,  (4) 

J n 

subject  to  the  constraints  imposed  by  the  state  equations  (1)  and  (2).  The 
solution  of  this  problem  provides  an  optimal  pair  e = [f^.n^j],  consisting 
of  a process  error  estimate  f^  and  a corresponding  observation  error 
estimate  n^,  that  is  the  least-squares  estimate  of  the  actual  model  errors 
8 = [f,n].  The  state  estimate,  satisfying  Au  = f^,  also  arises  from  the 
1 ea s t-squar e s formulation.  In  Ref.  [2],  a similar  least-squares 
formulation  has  been  used  to  develop  a combined  state  and  model  error 
estimation  approach  for  time-dependent  causal  systems.  The  aim  of  this 
paper  is  to  extend  these  results  to  infinite-dimensional  elliptic  systems. 
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Elliptic  Models  for  Estimator  Design 


Central  to  the  shape  determination  schemes  under  development  here  is 
the  selection  of  a mathematical  model  to  represent  the  elastic  behavior  of 
the  structure.  Three  classes  of  models,  useful  at  three  distinct  stages  of 
the  design  process,  that  are  applicable  to  this  selection  are:  1)  single 
partial-differential-equation  models  defined  over  a single  domain  for 
development  of  initial  physical  insight  and  understanding,  2)  multiple  PDE 
models  defined  over  multiple  spatial  domains  for  more  in-depth  parametric 
studies  and  estimator  design,  and  3)  piecewise-continuum  models,  defined 
over  a possibly  fine  structural-element  grid  (exact  finite-elements),  for 
detailed  design  and  evaluation,  A more  complete  description  of  these 
models  with  typical  examples  are  provided  in  Ref.  [1], 

The  Observation  Model 


To  complete  a description  of  the  models  used  for  estimator  design,  it 
is  of  interest  to  make  a few  brief  comments  about  the  observation  equation 
(2).  This  equation  is  characterized  by  the  operator  H defining  the  state- 
to-observations  map.  By  appropriate  definition  of  this  operator,  it  is 
possible  to  encompass  within  the  observation  model,  a wide  range  of  data 
collection  strategies  of  practical  interest. 


TYPE  OF  MEASUREMENT 


STATE-TO-OBSERVATION  MAP 


(a)  Full  Domain 

(b)  Boundary 

(c)  Discrete 


Hu  = u(x) 
Hu  = u(x) 
Hu  = [uCxj) 


xeO 

xer(T)  (5) 

,u(xf^)] 


Fig.  1 Illustration  of  Sensing  Strategies 


Some  typical  distributed  sensing  strategies  are  illustrated  in  Fig.l. 
Out  of  the  three  types  of  measurement  strategies  shown  above,  perhaps  most 
commonly  encountered  in  practice  is  the  one  corresponding  to  discretely 
distributed  sensing.  Consequently,  the  estimation  methodology  advanced 
here  will  ultimately  be  implemented  for  this  case.  However,  investigation 
of  the  schemes  involving  sensing  either  at  the  boundary  or  within  the 
entire  spatial  domain  is  also  of  interest  at  times  (in  this  paper),  because 
such  an  investigation  can  lead  to  valuable  insight  and  understanding  about 
the  estimation  problem  and  its  corresponding  solution. 
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The  function  space  approach  to  estimation  advanced  below  encompasses 
within  the  same  general  framework  all  of  the  three  oases,  provided  that  an 
appropriate  interpretation  of  H is  used.  Other  schemes,  such  as  those 
based  on  spatial-slope  data  (as  opposed  to  the  deflection  measurements 
discussed  so  far)  can  also  be  investigated  within  the  same  formulation  or, 
at  most,  with  possible  minor  modifications. 

2.  OUTLINE  OF  FDNCTION  SPACE  APPROACH  TO  ESTIMATION 

One  of  the  major  objectives  of  the  paper  is  to  advance  a function 
space  approach  to  estimation  for  elliptic  systems  that  extends  results 
reported  previously  [2],  applicable  primarily  to  time-evolving  causal 
systems.  Because  of  its  length,  it  is  convenient  to  partition  the 
presentation  of  the  approach  into  the  following  major  pivotal  steps  (see 
illustration  in  Fig.  2): 


1.  Showing  that  the  least-squares  estimation  problem  is  equivalent  to 
that  of  finding  a vector  of  smallest  norm  subject  to  linear 
constraints  (Sec.  3). 

2.  Establishing  that  the  optimal  estimate  (of  smallest  norm)  can  be 
determined  in  terms  of  the  generalized  inverse  of  the  operator 
characterizing  the  linear  constraints  (Sec.  3). 

3.  Computing  the  generalized  inverse  above  from  the  solution  of  a 
Fredholm  integral  equation  of  the  second  kind,  for  the  case  with 
continuously  distributed  measurements,  and  from  the  solution  of  a 
closely  related  matrix  equation,  for  discretely  distributed  data  (Sec. 
3). 


4.  Specifying  a number  of  alternative  methods  (direct  matrix  inversion, 
eigenfunction  expansions,  etc.)  for  solving  either  the  integral 
equation  or  the  related  matrix  equation.  These  methods  can  be 
classified  into  the  two  broad  categories  of  batch  solutions  (Sec.  4), 
in  which  all  of  the  data  is  processed  simultaneously,  and 
scanning  algorithms  (Sec.  5),  in  which  the  data  is  processed 
recursively. 


5. 


Developing  a statistical  analysis  that  results  on  explicit  formulas 
for  the  second-order  statistics  of  the  important  variables  (state  and 
model-error  estimates,  estimation  error,  etc.)  in  the  problem 
formulation  (Secs.  6-7). 


GENERALIZED 

MINIMIZE  INVERSE 


<€,  €> 

he  = y 


€q*  h*(hh*r^y 


MUST  savE 


COVARIANCE 

KERNEL 


hh*-(l+HLL*H-) 


OqIx)  + /rlx/f ) riglEldE  ' ylx) 
n^  = (l+HLL*H*r^ 

Fig.  2 Illustration  of  Function  Space  Approach  to  Estimation 
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3. 


MINIMUM  NORM  PROBLEM.  GENERALIZED  INVERSE  AND  FREDHOLM  EQUATION 


To  develop  the  estimation  approach,  it  is  convenient  to  first 
introduce  notation  necessary  to  define  suitable  function  spaces  for  the 
estimation  problem.  Let  Hj,  H2  and  Hg  denote  respectively  the  function 
spaces  to  which  the  process  error  f,  the  state  u,  and  the  observations  y 
(and  the  observation  error  n)  belong.  For  example,  Hj=H2= 
appropriate  for  the  problem  with  continuously  distributed  data,  while 
and  Hg=R^  would  be  applicable  to  the  situation  with  discrete 
measurements.  The  model  error  e is  then  defined  to  be  the  pair  e = [f,n] 
belonging  to  the  Cartesian  product  space  [3]  x Hg.  This  space  is 
endowed  with  the  inner  product  <8j^,e2>  = * (n.j^>n2)3»  where 

Sf  = [fj,nj^]  and  82  = [f2»i^2^  arbitrary  vectors  in  the  space,  and 

(•,•),  is  the  inner  product  inH.(S2). 

X X 

Based  on  these  definitions,  the  optimization  problem  can  be  recast  as 
follows : 


Hg==L2($J)  would  be 


minimize 

^8,8^ , 

(6) 

subject  to 

he  = y. 

(7) 

where  h = [ HL  I] 

is  a linear  operator  mapping  x 

Hg  ^ H3  that 

characterizes  the  linear  constraints  under  which  the  optimization  problem 
must  be  solved.  This  operator  h is  defined  in  terms  of  the  integral 
operator  L specified  by 


Lf  = / "^(x/Uf(^)d4,  (8) 

J ^ 

where  is  the  Green's  function  for  A in  (1).  If  A is  symmetric,  as  is  the 
case  in  most  elastic  models  representing  practical  structures, 

^ (x/^)  = ^5(^/x)  is  also  symmetric,  and  the  corresponding  operator  L=L  is 
self-adjoint  with  respect  to  the  inner  product  (•5*)2* 

The  model  error  estimates  =[f„,n„]  are  the  solutions  of  this 

000 

minimum  norm  problem.  The  unique  element  of  smallest  norm  is  given  by  13]: 

8y  = h*(hh*)“^  y,  (9) 

where  h mapping  x H3  is  the  ^operator  adjoin^  to  h with  respect  to 

the  inner  produc  t <•,*>.  Since  hh  = [I  + HLL  H ],  the  model  error 
estimates  (and  the  corresponding  state  estimate)  can  be  expressed  as 
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(10) 


= L*H*  [I  + HLL*H*]“^  y. 

Up  = [I  + HLlV]"^  y,  (11) 

= LlV  [I  f HLlV]"1  y,  (12) 

where  the  state  estimate  specified  by  (12)  has  been  obtained  by 
substitution  of  (10)  into  (8). 

The  above  are  the  most  general  equations  to  be  presented  here  for 
evaluation  of  the  model  error  and  state  estimates.  The  key  computation 
required  by  these  equations  is  the  inversion  of  the  operator  [I  + HLL*H*]. 
Because  this  inversion  takes  place  in  the  Hilbert  space  H3 > the 
computations  required  to  conduct  the  inversion  depend  on  the  measurement 
scheme  under  investigation  (see  Fig.  1).  For  continuously  distributed 
measurements,  as  in  Figs.  1(a)  and  Kb),  the  inversion  of  [I  HLL  H ] can 
be  achieved  by  solving  a Fredholm  integral  equation  of  the  second  kind. 
With  discretely  located  data  as  in  Fig.  1(c),  [I  HLL  H ] is  a matrix 
whose  dimensions  are  equal  to  the  number  of  data  points. 

Inversion  of  [I  + HLL*H*]  with  Continuously  Distributed  Data 


This  case  corresponds  to  that  of  Fig.  1(a)  and  assumes  that  H^  in  (1) 
is  the  identity.  With  this  assumption,  a direct  computation  of  LL  using 
(8)  leads  to  the  following  Fredholm  integral  equation  of  the  second  kind: 


n (x)  + / r(x/§)  n (5)  d^  = y(x)  for  x in  Q, 

Jn 


(13) 


specified  in  terms  of  its  kernel  r(x/^)  given  by 


./ 

Ju 


r(x/^)  = / ^(x/n)  ®(ti/5)  dn. 

U 


(14) 


Because  the  kernel  is  the  one  key  element  characterizing  the  integral 
equation,  it  is  of  interest  to  explore  some  of  its  major  properties. 
First,  observe  that  r(x/4)  is  a Green's  function  for  A*A,  i.e.. 


A*Aj^  r(x/?)  = 8(x“4), 


(15) 


where  6 is  the  impulsive  'delta'  function,  and  the  subscript  x in  A^ 
implies  that  the  spatial  differentiations  embedded  in  A are  conducted  with 
respect  to  x (as  opposed  to  being  performed  with  respect  to  ^).  This 
result  can  be  obtained  readily  by  operating  on  (14)  with  A A and  using 
twice  the  condition  A 6,  which  follows  from  the  observation  that  ^ is  a 
Green's  function  for  A.  This  result  implies  that  r(x/^)  can  be  computed 
directly  using  (IS),  without  necessarily  solving  first  for  an  approximation 
to  ^ and  then  using  (14)  to  approximate  r.  Such  a direct  method  could 
potentially  be  more  accurate  as  discussed  further  in  Ref.  [1]. 
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Another  important  property  of  r(x/^)  is  its  interpretation  as  the 
covariance  E[u(x)u(^)]  of  the  state  u,  under  the  assumption  (used  only 
infrequently  in  this  paper)  that  the  actual  errors  6 = [f,n]  in  (1)  and  (2) 
are  spatially  distributed  white  noise.  This  property  will  be  investigated 
further  as  an  integral  part  of  the  statistical  analysis  provided  in  Sec.  7. 

The  immediate  aim  here,  however,  is  to  first  state  the  matrix  equation 
related  to  (13)  that  must  be  solved  for  the  case  with  discretely  located 
data. 

Inversion  of  [I  t (M/) (HL)  ] with  Discretely  Spaced  Data 


This  case  corresponds  to  Fig.  1(c)  where  the  data  is  collected  only  at 
a finite  number  of  spatial  locations  distributed  throughout  the  domain. 
The  matrix  equation  analogous  to  (13)  that  must  be  solved  for  this  case  is 

N 

“o  ^ ^ “ 1,...,N,  (16) 
m=l 

where  n^  denotes  the  value  of  the  observation  error  estimate  at  the 
location  Xj^.,  and  where  N denotes  the  number  of  spatial  measurements.  Note 
that  in  both  (13)  and  (16)  the  kernel  r(x/^)  plays  a key  role  in 
characterizing  the  equation  to  be  solved.  For  the  problem  with 
continuously  distributed  data  in  Fig.  3(a),  the  kernel  r(x/^)  is  defined 
over  the  square  region  [xj^leQ  x Q.  For  the  case  with  discrete  data  in 
Fig.  3(b),  the  kernel  r(x/§)  needs  to  be  evaluated  only  over  a finite- 
dimensional grid  to  specify  the  matrix  equation  that  needs  to  be  solved. 
In  both  of  these  cases  A Ar  has  a singularity  along  the  diagonal  as 
indicated  by  Eq.  (15). 

In  the  foregoing,  it  has  been  established  that,  depending  on  the  type 
of  data,  available,  inversion  of  [I  + HLL  H ] can  be  achieved  by  either 
solving  a Fredholm  integral  equation  or  a related  matrix  equation.  A 
number  of  potential  batch  and  recursive  processing  methods  to  conduct  this 
computation  are  developed  in  the  following  two  sections. 


Fig.  3 Domains  of  Definition  of  State  Covariance  Kernel 
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BATCH  SOLUTIONS 


Tlie  aim  of  this  section  is  to  convert  equations  (10)-(12),  which  are 
based  on  a somewhat  abstract  operator  notation,  into  a more  explicit  form 
that  more  directly  represents  the  computations  that  must  be  performed  in  a 
batch-processing  estimation  mode.  The  following  three  major  batch- 
processing alternatives  are  explored: 

o Direct  inversion  of  the  matrix  (I+HLL*H*),  a solution  primarily 
applicable  to  the  discrete-data  case  and  only  applicable  as  an 
approximation  to  the  continuous-data  problem 

o Expansion  of  the  estimates  in  terms  of  the  eigenfunctions  of 
R = HLL*H*  defined  by  R«f>^  = <)>  t with  being  the  eigenvalues 

0 Solutions  based  on  the  Fredholm  resolvent  G of  R defined  by 
(I+R)"l  = (I-G) 


Estimates  Based  on  Inversion  of  (I+HLL  H ) Matrix  for  Discrete  Data  Problem 


This  approach  involves  a direct  solution  to  the  matrix  equation  in 
(16)  which  results  initially  in  a set  of  observation  error  estimates  n„  = 
[n^,...,n^].  Then,  to  complete  the  estimation  process  involving  also  the 
state  estimate  u^  and  the  process  error  estimate  f^,  Eqs.  (10)  and  (12)  are 
converted  into 


N 

Uq(x)  = > r(xj,/x) 


(17) 

(18) 


where  ^ and  r are  as  before  the  Green's  function  for  A in  (1)  and  the  state 
covariance  kernel  in  (14). 

The  computations  required  by  this  sequence  of  equations  can  be 
summarized  as  follows.  First,  the  value  of  the  kernel  r (x^/^j„)  must  be 
determined  at  the  grid  points  in  Fig.  3(b)  corresponding  to  the  discrete 
spatial  locations  xj^  and  where  the  measurements  are  being  taken.  For 
complicated  structures,  obtaining  the  kernels  r and  ^requires  a non- 
trivial computation  that  is  best  achieved  by  using  the  finite-element 
method  [Ij.  After  r and  ^ are  determined,  the  matrix  inversion  required  to 
solve  (16)  must  be  conducted  to  arrive  at  the  observation  error  estimates 
n^.  Finally,  f^  and  u^  are  determined  by  (17)  and  (18)  based  on  the 
kernels  and  r.  Note  that  if  knowledge  of  f^  and  u^  is  required  at 
a point  X where  a measurement  is  not  available,  then  the  kernel s ®(xj^/x) 
and  r(xj./x)  must  also  be  evaluated  at  that  point  in  conducting  the 
computations  in  (17)  and  (18). 

One  of  the  main  advantages  of  the  above  approach  is  the  conceptual 
simplicity  of  the  required  computations.  However,  in  situations  with  many 
discrete  measurements,  the  required  matrix  inversion  may  limit  the 
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applicability  of  the  method  (although  this  limitation  may  be  alleviated 
somewhat  by  performing  the  inversion  in  an  off-line  processing  mode). 

While  the  approach  is  most  useful  for  the  discrete-data  problem^  it 
can  also  be  viewed  as  an  approximate  method  to  solve  the  case  with 
continuously  distributed  data.  With  continuous  data,  HLL*H  is  an  integral 
operator  and  the  equation  that  must  be  solved  is  an  integral  equation.  An 
approximate  solution  to  this  equation  could  consist  of  first  discretizing 
the  integral  equation  to  obtain  a corresponding  matrix  equation  and  then 
solving  the  matrix  equation. 

Although  the  approach  is  computationally  convenient,  it  generally  does 
not  lead  ^to  much  insight  and  understanding  about  the  problem  of  inverting 
(1+ HLL  H ) and  of  interpreting  the  resulting  state  and  model  error 
estimates.  A more  thorough  understanding  of  the  inversion  problem  is  best 
obtained  by  investigating  the  two  methods  outlined  below. 

Eigenfunction  Expansions  for  the  State  and  Model  Error  Estimates 

This  approach  is  generally  applicable  to  both  the  continuous  and 
discrete  problems,  although  minor  differences  may  exist  between  the  two 
problems.  For  both  of  these  cases,  the  eigenvectors  R = HLL  H are  defined 
as  the  nontrivial  solutions  of 


R = Ik  <t>k,  (19) 

where  corresponding  eigenvalues.  For  the  discete-data  problem, 

R is  a matrix,  and  there  is  a finite  number  of  finite-dimensional 
eigenvectors  <}>k* 

For  the  continuous-data  problem,  R is  an  integral  operator  (that  in 
most  practical  cases  satisfies  the  property  of  compactness  [3])  with  an 
infinite  number  of  function-space  eigenvectors  (])k.  With  R compact , the 
corresponding  eigenvalues  A?  0 as  k “.  Note  finally,  as  a point  of 
interest,  that  in  cases  with  H in  (2)  equal  to  the  identity  and  l/=L  self- 
adjoint,  then  X-k  in  (19)  are  also  the  eigenvectors  of  L and  L <t>  k “^k k* 

In  terms  of  the  process  error  and  state  estimates  can  be  expressed 
as 

f.  “ S *0 

k 

"o  = 5!  "o 

k 

where  the  'modal'  coefficients  f^  and  u^  are  given  by 


(20) 

(21) 
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/V 


(22) 


1 * xi 


yfc' 


pk  = 


and  yj^  = (<))^»y)3  represents  the  corresponding  modal  coefficient  of  the  data 

y- 

Note  that  while  (20)  and  (21)  constitute  expansions  for  f^  and  no 
attempt  has  been  made  so  far  to  obtain  a similar  expansion  for  the  related 
observation  error  estimate  n^.  In  general,  such  an  expansion  is  not 
possible  because  (in  somewhat  imprecise  mathematical  terms)  n^  can  exhibit 
a *wild'  spatial  behavior  that  cannot  adequately  be  represented  by  nicely 
behaved  functions  To  illustrate  the  possible  pitfalls,  consider 


% = 2 ‘*’k'  (23) 
k 

an  equation  that  in  the  same  spirit  as  (20)  and  (21)  attempts  to  expand  n^ 
in  terms  of  the  eigenvectors  Routine  manipulations  based  on  the 
integral  equation  (13)  can  be  used  to  show  that 


yk' 


(24) 


where,  as  before,  y^.  is  the  modal  coefficient  of  the  data  y. 

This  equation  can  be  used  to  show  the  problems  inherent  with  the 
attempted  expansion.  Recall  first  that  for  typical  cases  0 as  k->-“, 
and  the  coefficient  1/(1+Xp  in  (24)  approaches  unity.  This  implies  that 
unless  yj^  0 sufficiently  fast  (which  would  be  the  case  if  the  data  y is 
well-behaved),  then  n^  maybe  such  that  the  corresponding  expansion  (23) 
does  not  converge.  Consequently,  the  expansion  (23)  must  be  used  with  care 
because  its  validity  is  guaranteed  only  in  cases  where  the  data  y is  well- 
behaved.  Note  parenthetically  that  in  the  discrete-data  case,  the  number 
of  eigenvalues  is  finite  and  the  above  convergence  problem  does  not  occur. 

One  way  to  avoid  the  convergence  problem  is  to  express  n^j  as 


“o  = y “ '^o' 


(25) 


where  the  state  estimate  u^  can  be  obtained  from  (21).  This  expression, 
which  does  not  involve  a direct  expansion  of  n^  as  in  (23),  provides  a more 
reliable  means  to  compute  the  observation  error  estimate  n^. 
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Expressions  for  the  Estimates  Based  on  the  Fredholm  Resolvent 


It  has  been  established  in  the  previous  section  that  inversion  of  the 
operator  (I  + R)  is  equivalent  to  the  solution  of  the  Fredhom  integral 
equation  (13).  If  the  Fredholm  resolvent  G defined  by  (I+R)”^  = (I~G)  is 
introduced  in  conducting  this  inversion,  then  the  solution  to  the  integral 
equation  is: 


" / g(x/4)y(4)d§,  (26) 

an  equation  that  determines  the  observation  error  estimate  in  terms  of  the 
data  y and  of  the  yet  to  be  determined  Fredholm  resolvent  kernel  g.  If  for 
the  moment  it  is  assumed  that  g is  available  (see  below  for  methods  that 
can  be  used  for  its  computation),  then  (26)  is  all  that  is  needed  to 
compute  the  observation  error  estimates.  The  related  estimates  f^^  and  u^ 
can  be  computed  by  means  of 


(27) 

(28) 

which  are  the  continuous-data  versions  of  the  discrete-data  equations  (17) 
and  (18). 

A useful  alternative  method  for  computation  of  u^  (for  the  case*  where 
H=I)  is  to  use 


fo(x) 


Uq(x)  = / r(x/5)nQ(4)d^, 


’^2 


Uq(x)  = / g(x/Oy(§)d^, 

J n 


(29) 


an  equation  that  can  be  readily  established  by  the  following  sequence  of 
mathematical  steps:  u^  = LL'*’H*(I+HLL*H*)’'^y  = LL*(I+LL*)~^y  = R(I+R)“^y  = 
Gy.  Because  it  provides  a direct  solution  for  the  state  estimate  u^  in 
terms  of  the  data,  this  equation  provides  a preferrable  alternative  to  the 
indirect  computation  of  u^  using  (26)  and  (28).  However,  as  noted  above, 
(29)  is  limited  to  cases  where  the  observations  are  available  throughout 
the  entire  spatial  domain. 


An  important  element  of  the  expressions  outlined  above  is  the  Fredholm 
resolvent  which  must  be  determined  before  computations  based  on  (26)  can  be 
carried  out.  Some  remarks  about  how  to  achieve  this  computation  are 
outlined  below.  Note  first  that  (I+R)  ^=(I-G)  implies  that  R=G+RG  which  in 
terms  of  the  corresponding  integral  operator  kernels  r and  g becomes 


r(x/^) 


g(x/§) 


r(x/q)g(q/^)dTi. 


(30) 
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This  represents  an  integral  equation  for  the  unknown  resolvent  kernel  g 
with  the  known  state  covariance  kernel  r given  by  (14).  One  approach  to 
its  solution  is  to  otain  an  expansion  in  terms  of  the  eigenfunctions  of 
R in  (19).  To  this  end,  it  is  useful  to  first  recall  the  following  Mercer 
expansion  [3]  for  the  kernel  r 

kj  <l>jj.(x)  ‘!>j.(5).  (31) 

k 

which  can  be  obtained  routinely  from  the  integral  representation  for  r in 
(14)  or,  equivalently,  from  the  boundary-value  problem  in  (15). 
Substitution  of  (31)  in  (3  0)  leads  to  the  following  expansion  for  the 
Fredholm  resolvent  kernel  g 


g(x/5)  = 2.  — ; ^32) 

k l+k£ 

This  summation  constitutes  the  desired  solution  g of  (30).  Note  that  the 
summation  is  very  well-behaved  because  as  k-><»,k^->0  and  consequently 

kj/(l+kj)  0. 

The  above  method  of  solution  applies  primarily  in  cases  where  all  of 
the  data  is  processed  simultaneously  in  a batch-processing  mode.  In  the 
following  section,  an  alternative  (sequential)  method  for  computation  of 
the  Fredholm  resolvent  will  be  used  to  develop,  in  certain  restricted 
geometries  and  coordinate  systems,  a class  of  scanning  algorithms  where 
recursive  processing  of  the  data  is  used  to  generate  the  desired  state  and 
model  error  estimates. 

5.  SCANNING  SOLOTIONS  FOR  CIRCULAR  REGIONS 

The  scanning  algorithms  developed  here  provide  an  interesting 
alternative  to  the  batch  solutions  just  presented  because  of  two 
fundamental  reasons:  1)  they  have  the  potential  for  avoiding  inversion  of 

large-dimension  matrices,  as  is  necessary  if  many  measurements  are 
available,  and  2)  they  solve  the  problem  of  how  to  process  additional 
data  (for  instance,  the  N+l  measurement)  after  estimates  based  on  a fixed 
data-set  with  N measurements  have  been  determined.  Additional  motivation 
for  investigating  the  scanning  solutions,  of  course,  is  due  to  the 
possibility  of  gaining  further  insight  into  the  estimation  problems  which 
constitute  the  central  theme  of  this  paper. 


Definition  of  Simple  PDE  Model  to  Illustrate  the  Scanning  Solutions 


Consider  a simple  PDE  model  consisting  of  V^u=f  defined  over  the 
circular  region  Q = {(p,0):  0 < p < P,  0 s 0 £ 2n}  with  the  free-boundary 
condition  9u(R,0)/dr=O  at  the  edge  of  the  region  and  the  internal  condition 
u(O,0)=O  at  the  origin.  The  observation  equation,  illustrated  in  Fig.  1, 
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y(0)=u(P,©)  + n(0) 


0 < 0 < T, 


(33) 


is  based  on  tbe  assumption  that  data  is  available  only  over  the  circular 
arc  r (T)  = {(p,0)s  p=R,  O^0^T}.  Partial  boundary  data  (as  opposed  to 
data  over  the  entire  circumference)  is  assumed  because  the  estimation 
problem  will  be  solved  first  with  a fixed  and  prescribed  data  interval 
0-0  — T.  Then,  the  scanning  algorithms  will  be  developed  by  examining  the 
dependence  of  the  fixed-interval  solution  on  incremental  changes  of  the 
upper  limit  of  observation  T. 

The  Fredholm  Integral  Equation  With  Partial  Boundary  Data 

It  has  been  shown  that  the  integral  equation  (13)  is  central  to  the 
estimation  schemes  under  development  in  this  paper.  For  the  particular 
example  involving  the  circular  region  described  above,  this  equation 
assumes  the  following  form: 


n„(Ti6) 


=y(«)  for  OsesT. 


where  the  dependence  of  the  observation  error  estimate  n^  on  T has  been 
indicated  by  the  explicit  notation  n^(T^0)„  As  before,  one  of  the  key 
features  of  this  integral  equation  is  the  kernel  r(R,0;R,<jj)  which  for 
this  particular  problem  has  the  following  interesting  properties. 


For  the  example  under  study  here,  the  partial  differential  equation 
for  the  state  covariance  kernel  becomes 


v^r(p,0;q,^)  = p“^6(p-q)8(0-  ijj)  , 


where  = 9^/3p^  + p”^9/90  + p”^9^/90^  is  the  Laplacian  operator  in  polar 
coordinates.  In  order  to  compute  r,  this  equation  must  be  solved  with 
both  r and  V ^r  simultaneously  satisfying  the  free  boundary  conditions  and 
the  internal  conditions  specified  for  the  original  problem  in  the  previous 
subsection.  Note  parenthetically  that  the  kernel  r(p,0;q,  i|;)  corresponds  to 
the  solution  of  (35)  evaluated  at  an  arbitrary  point  v/ith  polar  coordinates 
(p,0)  under  the  assumption  that  the  forcing  term  represented  by  a 'delta" 
function  is  applied  at  the  noncoincident  point  (q»i(j).  The  solution  of  (35) 
is  very  conveniently  obtained  using  a trigonometric  expansion  along  the 
0 direotiom  Use  of  such  an  expansion  for  x(p,%q,ip)  and  evaluation  along 
the  radial  boundary  p=q=R  results  in  the  following  expression 

R^  sp  cos  k (0-  4^ ) 

r(R,0;R,i|j)  = ~ / (36) 

2n  ^ k^'(kil) 
k 
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This  provides  a trigonometric  expansion  for  the  kernel  involved  in  the 
integral  equation  (34).  Inspection  of  the  above  expansion  reveals  that  the 
kernel  r is  stationary,  periodic  and  irrational  as  discussed  in  more  detail 
below. 


The  kernel  r is  azimuthallv  stationary  in  the  sense  that  it  is  a 
function  of  the  difference  ^=6-  of  its  two  azimuthal  arguments 
6 and  ip  . Hence,  the  simplified  notation  = r(R,6;R,i|;)  can  be  used, 
and  the  resulting  function  investigated.  Note  from  (36)  that  when 
r^(p)  is  viewed  as  a function  of  p,  it  can  be  interpreted  as  a correlation 
function  with  symmetry  r^(p)=r^(-p)  about  the  origin  p=0,  and  with  its 
maximum  value  3Cq(0)  being  achieved  at  P=0.  The  kernel  Iq(P)  is  the 
correlation  r (p)  = E [u(R;O)u(R,0+p)  ] of  the  boundary  values  u(R,6)  and 
u(R,6'i-p)  of  the  displacement  u,  under  the  assumption  that  the  actual 
process  error  is  spatially  distributed  white  noise  (see  Sec.  7 of  this 
paper  for  a more  complete  discussion  of  this  interpretation). 

Stationarity  of  the  kernel  Iq(P)  is  closely  related  to  the  circular 
symmetry  and  homogeneity  of  the  model  equations  and  of  the  circular  domain. 
Usually,  kernels  that  possess  this  property  lead  to  special  cases  of  the 
integral  equation  (34)  that  are  easier  to  solve  than  the  general  case  in 
which  stationarity  of  the  kernel  does  not  exist.  One  of  the  interesting 
areas  requiring  further  investigation  involves  the  development  of  solutions 
to  (34)  that  make  optimum  use  of  the  stationarity  of  the  kernel.  However, 
these  solutions  are  not  yet  available.  The  scanning  algorithms  to  be 
developed  in  a subsequent  subsection  do  not  make  use  of  this  property. 
Consequently,  such  algorithms  may  not  be  the  most  efficient  solutions  in 
cases  where  stationarity  holds.  On  the  other  hand,  however,  their  range  of 
applicability  is  much  broader  since  they  apply  to  general  types  of  kernels 
without  being  constrained  to  integral  equations  where  the  stationarity 
condition  is  satisfied. 

An  additional  property  of  the  kernel  r^(p)  is  that  it  is  irrat ional. 
in  the  usual  sense  that  its  double-sided  Laplace  transform 
r^(s)=  / exp(-sp)rQ(p)dp  is  not  a ratio  of  finite-order  polynomials  in 
s.  This  property  is  a direct  consequence  of  the  infinite  dimensionality  of 
the  mapping  u=HLf  from  f(*,*) — ►-n(R,  •).  The  main  implication  of  this 
property  is  that  the  scanning  solutions  to  be  developed  will  necessarily 
involve  infinite-dimensional  models  (which  in  the  end  may  nonetheless  have 
to  be  approximated  by  finite-dimensional  computations). 

The  final  major  condition  of  the  kernel  is  its  azimuthal  periodicity 
r (p)=rp(p+2n).  This  property  will  turn  out  to  have  an  important  impact  on 
the  resulting  scanning  algorithms. 

A more  detailed  investigation  of  the  foregoing  stationarity, 
periodicity,  and  irrationality  properties  of  the  kernel  rp(p),  as  well  as 
the  development  of  corresponding  estimation  schemes  that  make  optimum  use 
of  these  properties,  is  currently  under  way  but  definitive  results  are  not 
yet  available.  For  now,  the  paper  turns  to  the  development  of  scanning 
solutions  where  such  properties  are  not  fully  exploited,  with  the 
understanding  that  the  more  refined  schemes  are  yet  to  come. 
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A Partial  Differential  Equation  for  the  Fredholm  Resolvent 


In  the  particular  example  under  consideration  here,  the  integral 
equation  (30)  for  the  Fredholm  resolvent  g becomes 


r^^(e-<|>  )=g(Ti0,i|^) 


r^(0-Tl)g(T;q,iJ^)dii, 


(37) 


where  the  notation  g(T;0,ijj  ) indicates  explicitly  the  dependence  of  g on 
the  upper  limit  of  observation  T.  The  immediate  aim  here  is  to  use  (37)  to 
develop  the  celebrated  [4]  Krein-Bellman  equation 


3g(Tj0,  ip)/dT  i-  g(TfO,T)g(TiT,ip)  = 0. 


(38) 


by  means  of  the  following  sequence  of  manipulations.  Differentiation  of 
(37)  with  respect  to  T leads  to 


0 = 


3 8 

— - (T;0.  >1^  ) 
3T 


rQ(0-ri) 


3T 


(T;n,4^)dti  + rQ(0-T)g(T;T,<J^) 


(39) 


Evaluation  of  (37)  at  =T  implies  that  the  last  term  in  (39)  is  given  by 
ry(0-T)g(T|T,4))=g(Tj0,T)g(TfT.t(^)  + J r^(e~ii)g(T;r}.T)g(TsT,  ^ )dri.  (40) 


and  substitution  of  (40)  in  (39)  leads  to 


T 

ro(0“'H)0(Tiq,iji)d'n,  (41) 

0 

where  Q(T|0,  )=a  g(T;©,  ) / dT  + g (T;  0,T)  g (Tj  T,  ijj ) . Since  (I+R)  is 

nonsingular,  (41)  implies  that  0=0  thereby  establishing  the  identity  (38). 
This  is  the  central  result  needed  to  establish  the  Gohberg-Krein 
factorization  for  this  problem  as  outlined  below. 


0 = Q(T;0,  ijj  ) 


Gohberit-Krein  Factorization 

Tlie  factorization  states  that 


(I  + HLL*H*)"^  = (I  a-E£^), 


(42) 
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where  ES^&nd  SB  H are  the  mutually  adjoint  operators 


ESBj 


^Ve  = 


g(t;t,T)y(r)dT  , 


g(t;T,t)e(T)dt, 


(43) 

(44) 


This  result  can  be  established  by  first  computing  SB  H ESB  from  (43)  and 
(44)  and  then  using  (38)  to  obtain  the  identity 


G=ESB*  <B*E*  -SB*'^ESB,  (45) 


that  relates  the  Fredholm  resolvent  G and  the  Volterra  operators  H^and 
SB  H*.  This  equation  implies  that  [I+HLL*H*]~^  = (I-G)  = I-H^  SB*Y^ 
+ BB*1^ESB  - (I-  SB*E  ) (1-ESB),  which  is  the  desired  result. 

The  State  and  Model  Error  Estimates 

A direct  implication  of  (42)  is  that  the  model  error  estimate 
eo=[fo*“o^  can  be  expressed  as 

eo=h*(I-  ^*H*)(I-H^)y,  (46) 

or  more  explicitly  as  n^  = (I-.^"*!!*)  (I-E,^)y  and  fQ=L*H*ng.  This  equation 
defines  n^  (and  f^  as  well)  in  terms  of  a linear  transformation  operating 
on  the  data  y.  This  overall  transformation  can  be  viewed  as  the  product  of 
the  two  terms  (1- .0?  H ) and  (I-H^)  which  specify  a two-stage  scheme  for 
processing  of  the  data  y.  The  first  stage,  corresponding  to  (I-H^),  leads 
to  a 'filtered'  estimate  y together  with  a residual  (innovations) 
process  e=(I-H^)y.  The^residual  process  e is  then  operated  upon  by  the 
second  factor  (I-  H ) to  obtain  the  desired  observation  error 

4t  4t 

H )e.  The  process  error  f^  and  the  state  estimates  u^ 
corresponding  to  n can  be  obtained  by  means  of  the  equations  (10)  and 
(12). 


An  illustration  of  the  foregoing  two-stage  process  defined  by  the 
consecutive  transformations  (I-Hi^)  and  (1~SB  H ) in  (46)  is  shown  in  Fig. 
4.  The  sketch  on  the  left  corresponds  to  the  operator  (I-ES^),  whereas  the 
one  on  the  right  illustrates  its  adjoint  il-SC^^E  ).  The  clockwise 
operation  on  the  left  leads  first  to  an  estimate  H ^y  and  then  to  an 
innovations  process  e=(I-HiZ')y.  The  innovations  process  e is  obtained  by 
substracting  the  observed-state  estimate  (in  this  particular  case,  the 
value  of  the  state  estimate  at  the  boundary)  from  the  boundary  data  y.  The 
innovations  process  e,  defined  by  e=(I-ESB)y,  is  then  used  in  the  counter- 
clockwise stage,  illustrated  on  the  right-side  sketch,  to  obtain  the 
desired  observation  error  estimate  n^. 
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(I -Hit)  RESIDUAL 


Figure  4.  Scanning  Solutions  Based  on  Azimuthal  Sweeps 
Computation  of  Volterra  Kernels 

The  one  missing  ingredient  in  the  forward-backward  azimuthal  sweep 
just  described  is  the  specification  of  the  kernels  g(t^t,r)  and  g(tjn:,t)  in 
and  H in  (43)  and  (44).  For  causal  time-evolving  systems,  it  is 
known  [2]  that  the  kernels  can  be  computed  as  g(t,:t,T)  = (t,T)p(r)H*  and 

g(t;Tr„t)=Hp(t)  $^(T;,t)H*,  where  $ is  the  transition  matrix  for  a Kalman 
filter,  and  p is  the  solution  to  the  related  Ricatti  equation. 
Unfortunately,  such  general  results  for  computation  of  the  kernels  are  not 
yet  available  for  the  class  of  elliptic  systems  under  investigation  here, 
where  the  causal  structure  generated  by  the  azimuthal  sweep  is  introduced 
somewhat  artificially.  Analysis  to  date  indicates  that,  contrary  to  the 
time- evolving  causal  case,  the  kernels  cannot  be  generated  by  means  of  a 
simple  Kalman  filter  (even  one  of  infinite  dimensionality).  The  main 
reason  for  this  lack  of  simplicity  is  the  periodicity  of  the  kernel  r^(p) 
in  (36).  However,  the  kernels  g(tit,t)  and  g(t|T,t)  in  (43)  and  (44)  are 
well-defined  functions  of  the  running  variable  r and  the  upper  limit  of 
observation  t.  Hence,  even  though  they  most  likely  cannot  be  generated  by 
means  of  straight-forward  Kalman  filtering,  alternative  methods  for  their 
computation  should  be  effective.  Such  methods  are  currently  under 
investigation. 

6.  RELATIONSHIPS  BETWEEN  ACTUAI,  AND  ESTIMATED  ERRORS 

One  of  the  key  results  of  the  previous  sections  is  that  in  equation 
(9),  BQ=h*(hh*)~^y,  which  relates  the  model  error  estimates  to  the 
available  data  y.  By  substitution  of  (7)  in  (9),  it  is  possible  to  go  one 
step  beyond  and  obtain  the  following  relationship  between  the  actual  and 
estimated  errors: 


= We,  (47) 

where  W=h  (hh  )’’’*'h  is  a linear  transformation  whose  mathematical  properties 
are  important  to  understand. 

Result  6.1  W is  a self-adioint  projection  operator.  This  result  can  be 
established  readily  by  the  following  sequence  of  statements  = 

[h*(hh*)~^h][h*(hh*)"^h]  = h*(hh*)“^h  = W.  Hence,  W^=W  and  W is  a 

projection  operator.  Self-adjointness  of  W can  be  established  by 
inspection  of  the  definition  W = h*(hh*)“^h. 
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Result  6.2  (I-W)  is  a self-adioint  proiection  operator.  Note  that  (I-W)^ 

= (I-2WtW^)=(I-W) , thereby  establishing  the  result. 

Define  now  the  model-error  estimation  error  8=6-8^  as  the  difference 
between  actual  and  estimated  errors.  Since  8q=Ws,  then  8p=e-We  and 


8p  = (I-W) 8, 


(48) 


an  equation  that  relates  the  model-error  estimation  error  and  the  actual 
error  e. 

Result  6.3  and  are  orthogonal  complements.  This  result  follows 
immediately  from  = <We,(I-W)e>  = <8,(W-W^)s>  = 0.  This  last  result 

is  extremely  interesting  because  it  establishes  a decomposition  of  the 
actual  error  vector  e as  a sum  of  orthogonal  complements  which  can  be 
interpreted  geometrically  as  outlined  below. 

Geometrical  Interpretation 

The  horizontal  axis  in  the  figure  denotes  conceptually  the  process 
error  f,  while  the  vertical  axis  corresponds  to  the  observation  error  n. 
These  two  variables  comprise  the  two-component  model  error  vector  8=[f»n]. 
Consequently,  an  arbitrary  error  vector  can  be  represented  as  a point  on 
the  two-dimensional  surface  shown  in  the  figure.  Those  pairs  e=[f,n]  which 
satisfy  the  system  model  equation  (7)  define  conceptually  a straight-line 
segment  characterized  by  the  relation  he=y.  The  observation  y is 
represented  as  a single  point  appearing  at  a prescribed  location  along  the 
vertical  axis.  The  minimum  norm  problem  in  (6)  and  (7)  can  be  thought  of 
as  that  of  finding  the  shortest  vector  8^  from  the  origin  to  the  straight- 
line  segment  hs=y.  The  corresponding  minimum  distance  ^ equals 

the  norm  of  the  minimum  error  vector  8Q=[fp,nQ]  representing  the  model 
error  estimates.  By  simple  geometry  (see  Fig.  5),  the  shortest  vector 
joining  a point  and  a straight  line  is  orthogonal  to  the  line.  This  also 
implies  that  an  arbitrary  actual  error  vector  e (one  that  satisfies  h8=y, 
but  whose  norm  may  not  be  minimum)  can  be  decomposed  as  the  sum  of  the 
minimal  error  vector  e^  and  its  orthogonal  complement  Cp, i.e.. 


OBSERVATION 
i ERROR 


€ = ACTUAL 

= OPTIMAL 

= ORTHOGONAL 
P COMPLEMENT 


CONSTRAINT  h € = y 


f PROCESS  ERROR 


Fig.  5 The  Actual  Error  Vector  as  a Sum  of  Orthogonal  Complements 
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B=e  + 8^ 
o p 


(49) 


where  <eQ,8p>=0,  The  square  of  the  'length'  of  the  vector  e of  non-minimum 
norm  can  be  expressed  as 


<e,e>  = <eQ.e^>  + <8^,8^), 


(50) 


an  equation  that  can  be  interpreted  as  a mathematical  statement  of  the 
ancient  geometrical  theorem  that  the  square  of  the  length  of  the  hypotenuse 
of  a right  triangle  equals  the  sum  of  the  squares  of  its  sides. 

Result  6.4  Azimuthal  factorization  of  W in  circular  regions.  The  objective 
here  is  to  show  that,  in  cases  where  the  Gohberg-Krein  factorization  (42) 
holds,  W can  be  factored  as  the  product 'W=w  w of  the  yet  to  be  defined 
operator  w and  its  adjoint  w . 

To  establish  this  result,  it  is  convenient  to  begin  by  recalling  that 
by  definition  W=h*(hh*)“^h  = h*(I+ HLL*H*)“^h.  Use  of  the  Gohberg-Krein 
factorization  in  this  equation  implies  that 


W=w*w, 


(51) 


where  w is  the  operator  w«(I-Hi2’)h.  A block  diagram  illustrating  this 
factorization  is  shown  in  Fig.  6.  The  diagram  shows  the  two  major  |tages  w 
and  w that  together  constitute  the  overall  transformation  W=w  w.  The 
first  stage  w=(I-H^)h,  shown  on  the  left  portion  of  the  diagram,  maps  the 
actual  error  vector  e=[f,n]  into  the  residual  process  e=we.  This  first 
stage  characterized  by  w involves  first  the  generation  of  the  data  y by 
means  of  the  equation  y=he=HLf+n  followed  by  the  use  of  this  data  to 
generate  the  re|idual  process  e=(I-H^)y.  The  second  stage,  represented  by 
the  operator  w , operates^on  the  residual  progess  in  order  to  obtain  the 
model  error  estimates  Sq=w  e.  Because  w and  w are  mutually  adjoint,  the 
first  and  second  stages  can  be  viewed  as  being  mirror  images  of  each  other, 
with  respect  to  the  dotted  vertical  line  in  the  diagram  representing  the 
innovations  process. 


INNOVATIONS 


Fig.  6 Factorization  of  W 

Another  very  interesting  result  is  that  if  the  above  two-stage  process 
is  reversed,  and  w*  is  performed  before  w,  then  the  identify  transformation 
results. 
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Result  6.5  ww*=I.  This  identity  can  be  established  by  the  following 
sequence  of  operations  ww*=(I-H^)hh*(I-^*H*)=(I-H_S?) (1+ HLL*H*) 
(I-^V)=I. 

Result  6.6  The  ’lengths*  of  the  minimal  error  vector  and  of  the  innova- 
tions process  are  equal.  This  result  follows  from  <8(jtep>=<we,we  >=(e,w*we>3 
which  implies  that 


<®o'®o>  = (e,e)3. 


(52) 


Both  of  these  last  two  results  are  intimately  related  to  the 
'whiteness'  of  the  innovations  process  in  cases  where  the  actual  errors  e 
are  assumed  to  be  spatially  distributed  white-noise,  as  will  be  discussed 
in  more  detail  in  Sec.  7. 

An  Eigenfunction  Expansion  for  W 

Although  an  eigenfunction  expansion  is  possible  in  both  the  continuous 
and  discrete  cases,  only  the  continuous-data  problem  is  discussed  here, 
because  the  notation  is  much  simpler  for  this  problem.  For  the  continuous- 
data  problem,  the  transformation  W can  be  viewed  as  a mapping  from  the 
product  space  L2(fi)  x L2((2)  into  itself.  The  transformation  can  be  more 
explicitly  characterized  by  the  following  set  of  equations 

fo<x)  = / W^;^(x/Uf(Ud?  + / W2i(x/On(Ud^,  (53) 

Jo,  JQ 


no(x) 


W2i(x/^)f(Ud§  + n(x)-  / W22(x/Un(Od? 


-/»2 


(54) 


where  W ^ . are  integral  operator  kernels  that  characterize  the 
transformation.  By  means  of  routine  manipulations,  it  can  be  shown  that 
these  kernels  can  be  expressed  as 


Wii(x/U  = W22(x/U 


= 2 


1+X: 


(55) 


= W2i(x/?) 


uxl 


‘#>k(x) 


(56) 


in  terms  of  the  eigenfunctions  <|>|^  of  the  operator  R in  (19). 
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Reorientation 


Tliis  concludes  the  development  of  a number  of  important  relationships 
between  actual  and  estimated  errors.  In  the  following  section,  these 
relationships  will  be  shown  to  be  fundamental  to  the  study  of  a 
probabilistic  interpretation  of  the  deterministic  least-squares  estimates 
previously  described. 

7.  COVARIANCE  ANALYSIS  WITH  WHITE-NOISE  MODEL  ERRORS 

Tlie  estimation  approach  presented  so  far  is  based  on  the  principle  of 
least-squares,  which  involves  the  deterministic  minimization  of  a quadratic 
functional  of  the  model  errors.  This  deterministic  approach  does  not 
require  that  probabilistic  notions  be  introduced  in  order  to  derive  the 
state  and  model  error  estimates.  Equations  (10)-(12)  can  be  viewed  as 
establishing  a totally  deterministic  rule  by  which  the  data  may  be 
processed  to  obtain  the  desired  estimates.  Here,  the  objective  is  to 
explore  a probabilistic  interpretation  of  the  estimates.  This  is  done 
under  the  assumption,  not  previously  used  in  the  paper,  that  the  actual 
errors  possess  the  properties  of  a yet  to  be  defined  spatially  distributed 
white-noise  process. 

Tbe  specific  objectives  of  the  section  are; 

o to  investigate  the  second-order  statistical  properties  (the 
covariance  primarily,  since  all  processes  are  zero-mean)  of  the 
state  and  model  error  estimates  Uj,  and  and  of  the  corresponding 
state  estimation  error  ^p=u-Up  and  model-error  estimation  error 

o To  obtain  explicit  expressions  for  the  covariance  of  the  estimation 
errors  Up  and  Sp  and  to  obtain  related  expressions  for  the  mean- 
square  es^timation  errors  E(Up,Up)  and  E ^ep,Sp>. 

o To  show  that  the  state  and  model  error  estimates  u^  and  can  be 
interpreted  probabilistically  as  the  conditional  expectations 
8Q=E(e/y)  and  u^j=E(u/y). 


Since  these  results  are  based  on  the  assumption  that  the  actual  errors 
e are  spatially  distributed  white-noise,  it  is  of  interest  to  first  make 
somewhat  more  precise  the  notion  of  such  a process. 

Definition  of  a Spatially  Distributed  White-Noise  Process 

Intuitively  (see  [3]  for  a more  precise  definition),  a spatially 
distributed  white-noise  process  f is  characterized  by  an  impulsive  'delta- 
funct.ion'  covariance,  i.e.. 


E[f(x)f(4)]  = 6(x-§). 


(57) 


567 


This  definition,  which  is  in  the  same  spirit  as  that  used  for  time- 
dependent  causal  systems,  implies  that  the  values  f(x)  and  f(4)  of  the 
process  at  the  two  distinct  points  x and  ^ are  totally  uncorrelated.  The 
definition  also  reflects  the  fact  that,  if  the  points  x and  ^ are 
coincident,  the  correlation  becomes  infinite.  Based  on  this  definition,  it 
is  possible  to  state  that,  if  e-[f,n]  are  assumed  to  be  white-noise,  then 
the  corresponding  covariance  is 


E[8(x)8^(U]  = U8(x-U» 


(58) 


where  U is  an  appropriately  dimensioned  unit  matrix  that  accounts  for  the 
possibility  that  e(x)  may  be  a vector  of  a finite  number  of  dimensions. 

The  foregoing  is  only  a formal  definition  of  the  covariance  of 
spatially  distributed  white-noise,  because  it  attempts  to  define  in  a 
pointwise  sense  a quantity  (the  spatial  covariance)  that  cannot  be  defined 
precisely  in  this  sense.  A somewhat  more  rigorous  notion  is  that  of  a 
covariance  operator  for  a white-noise  process  as  defined  below. 

Definition  of  the  Covariance  Operator  of  a White-Noise  Process 

In  approximate  terms,  the  covariance  operator  is  the  'integrated' 
version  of  the  covariance  function  in  (57).  For  example,  the  covariance 
operator  E[ee  ] for  the  process  e is  defined  to  be  that  integral  operator 
whose  kernel  is  the  covariance  function  in  (58).  Since  the  covariance 
function  of  white-noise  is  impulsive,  the  corresponding  covariance  operator 
is  the  identity.  This  can  be  observed  from  the  following  sequence  of 
operations:  E[ ec*]  v=  jjE[ 8 (x) e(5) ] v(4)d^=v,  which  is  valid  for  all 
admissible  functions  v.  Consequently,  the  covariance  operator  E(88  ) of  8 
satisfies  the  condition 


E(ee*)=I,  (59) 


where  I is  the  appropriately  dimensioned  identity  operator.  Sq.  (59), 
which  can  be  viewed  as  the  integrated  version  of  (58),  represents  the  key 
assumption  needed  to  compute  the  corresponding  covariances  of  the  estimates 
and  of  the  related  estimation  errors. 

The  Covariance  Operator  E[uu*3  of  toe  State  u.  The  following  sequence  of 
steps  ECuu*]  = ECLff^'L*]  = LE[ff*]L*  = LL*  imply  that 


E[uu*]  = LL*. 


(60) 


Note  that  the  kernel  of  LL  is  r(x/§)  implying  that 
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E[u(x)u(4)]  = r(x/?) 


(61) 


whicli  justifies  the  earlier  interpretation  of  r as  the  state  covariance 
kerne  1 . 

The  Covariance  Operator  E[yy*]  of  the  Data  y.  This  operator  is  given  by 

E[yy*]  = I t HLL*H*,  (62) 

as  can  be  established  by  E[yy*]=E[h8e*h*]=hE[8e*]h*=hh*=I+HLL*H*. 

The  Covariance  Operators  E[e^8*]  and  E[8j,e*]  of  8^  and  8^. 
The  following  sequence  of  steps  E[e^j8*  ]=E[Wee*W*]=WE[88*]  W*=W^=W  and 
E[e^e*]=E[(I-W)ee*(I-W)]=(I-W)E[88*](I-W)=(I-W)  imply  that 

Jr  ir 


E[8o8;1=W, 

(63) 

E[ep8p]=I-W, 

(64) 

where  W is  the  projection  operator  defined  in  (47)  that  relates  the  actual 
and  estimated  errors  8 and  8^. 

Whiteness  of  the  Innovations  Process.  This  result 

E(ee*)  = I,  (65) 

valid  in  cases  where  the  Gohberg-Krein  factorization  holds,  follows  from 

4^  4^4^  4^ 

the  following  sequence  of  steps  E(ee  )=E[wc8  w ]=ww  = I. 

The  Covariance  Operator  E[UpU*]  of  the  State  Estimation  Error  Up»  The 
state  estimation  error  Up  is  defined  as  the  difference  Up^u-u^  of  the 
actual  and  estimated  state  u and  u^^.  Since  u=Lf  and  then 

Up=L( f-f^)=Lf p,  where  fp  is  the  process-error  estimation  error  that 
together  with  np  forms  the  two-component  model-error  estimation  error 
8p=[fp,np].  Hence,  E[UpU*]=LE[fpfpIL*  and 

E|[UpuJ]=L[I-L*H*(I+HLL*H*)“lHLl  L*,  (66) 
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where  Etfpfp]  has  been  computed  by  taking  that  component  of  E[  SpeJ]  in  (64) 
that  corresponds  to  E[fpfp], 

An  alternative  expression  for  E[u  u*]  can  be  found  by  rearranging  (66) 
to  obtain 


E[UpU*]=L(I+L*H*HL)"^L*.  (67) 

Note  that  (66)  requires  inversion  of  the  operator  (I'i’HLL  H ),  whereas  (67) 
involves  inversion  of  the  related  but  distinctly  different  operator 
(I+L*H*HL).  The  availability  of  the  two  options  in  (66)  and  (67)  provides 
the  possibility  of  using  whichever  option  turns  out  to  be  simplest  in  a 
given  situation. 

The  Mean-Square  State  Estimation  Error  E(Up,Up).  The  mean-square  state 

estimation  error,  defined  as  E(n_,u_),  is  related  to  the  trace  of  the 

4i  P P 

covariance  operator  E[u  u ] by  means  of  the  following  identity 

Jr  Jr 


E(Up,Up)=Tr  E[UpUp],  (68) 

where  the  trace  Tr  of  an  operator  is  defined  in  the  sense  of  Ref,  [3]. 
Substitution  of  (67)  in  (68)  leads  to  the  following  expression  for  the 
mean-square  state  estimation  error 

E(Up, Up)=Tr [L(I+  L*H*HL)~^L* ] (69) 

An  alternative  expression  for  E(n  ,u  ),  of  course,  can  be  obtained  by 
substitution  of  (66)  in  (68). 

An  eigenfunction  expansion  can  also  be  found  for  E(Up,Up)  in  terms  of 
the  orthonormal  basis  in  (19).  For  simplicity,  the  assumption  is  made 
that  H’=E*=I,  corresponding  to  the  case  with  observations  over  the  entire 
domain.  For  this  case,  it  can  readily  be  shown  that 

E(Up,Up)=  ^ A.J/(l+l|),  (70) 

k 

2 ^ 

where  are  the  eigenvalues  of  ^LL  . 

The  Mea^-Square  Model-Error  Estimation  Error  E<ep,8p>.  Since  ^^®p»®p>~ 
TrE[Sp8p],  Eq.  (64)  implies  that 

E<ep,8p>=Tr(I-W),  (71) 
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or,  in  terms  of  the  components  fp  and  np  of  6p, 


E<Sp.ep>=E(fp,fp)  + E(np,np), 

(72) 

E(  J;_ , f „)  =Tr  [ I-L*H*  ( 1+  HLL*H* ) “^HL]  , 

Mr  Mr 

(73) 

E(np,np)=Tr[I-(I+HLL*H*)-l] , 

(74) 

The  last  two  terms  E(f„,f^)  and  E(n^,n„)  are  the  mean-square  estimation 

P P P P 

errors  inherent  in  the  process-error  and  observation-error  estimates.  Mote 
parenthetically  that  by  use  of  the  identity  (I+HLL  H ) -I-G,  where  G is 
the  Fredholm  resolvent  in  (26),  the  condition  (74)  can  be  converted  to 


E(m  ,n  )=Tr[G]  = / Tr[g(x/x)]dx  (75) 


an  equation  which  is  somewhat  more  convenient  than  (74). 

Eigenfunction  expansions  for  E(f  ,f  ) and  E(n  ,n_)  can  also  be  found  as 

ir  Mr  MT  Mr 


E(fp,fp)=  ^ 

> l/(l+kj). 

(76) 

k 

E(ttp.hp)“  ^ 

kj/(l+kj). 

(77) 

k 


where,  for  simplicity,  it  has  been  assiuned  that  H=H*=I  in  (73)  and  (74). 
Unbonndedness  of  the  Process-Error  Mean-Sauare  Estimation  Error 

Under  the  assumption  used  in  this  section  that  the  actual  errors  e 
are  white-noise  and  that  the  corresponding  covariance  operator  Else  ] is 
the  identity  as  in  (59),  the  mean-square  estimation  error  E(f  ,f  ) of  the 
process  error  becomes  unbounded.  This  unboundedness  can  be  ascertained  by 
inspection  of  the  right-hand  side  of  (76).  By  an  argument  similar  to  the 
one  used  to  analyze  the  summation  in  (23),  it  can  be  established  that  the 
term  1/(1+X,^)— 1 as  k-*«>.  This  implies  that  the  summation  in  (76)  does  not 
converge,  and,  in  fact,  becomes  unbounded. 

This  lack  of  convergence  is  due  to  the  fact  that  the  process-error 
estimation  error  f =f-f^  is  a spatially  distributed  process  that  for  large 
k retains  many  of  the  spatial  characteristics  of  the  white-noise  process  f. 
This  similarity  between  f and  f at  large  k can  follow  from  the  fact  that 
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f is  a 'smoothed'  estimate  in  the  sense  that  it  is  dominated  primarily  by 
the  spatial  characteristics  associated  with  small  values  of  k.  Since  is 
being  subtracted  from  f to  obtain  f^,  the  resulting  process  will 

tend  to  be  dominated  at  large  k by  the  spatial  characteristics  of  the 
initial  white-noise  process  f. 

Although  it  should  be  kept  in  mind  at  all  times,  the  problem  of 
unboundedness  of  E(f  ,f  ) is  not  as  serious  as  it  may  seem  initially,  since 
the  white-noise  assumption  used  for  f is  never  satisfied  in  practice.  In 
physical  situations,  the  actual  errors  are  never  white  and  the 
corresponding  model  error  estimates  f^  and  the  process  error  estimation 
error  f_  are  well-behaved.  The  white-noise  assumption  (59)  is  a 
mathematical  fiction  that  generally  is  useful  because  it  leads  to  a 
relatively  simple  'a  priori'  covariance  analysis  as  represented  by  (60)- 
(77).  However,  the  results  obtained  by  such  an  analysis  should  be  used 
only  if  they  make  sense  and  should  be  discarded  otherwise.  The 
unboundedness  of  E(fp,fp)  is  therefore  a property  that  is  primarily  of 
mathematical  interest  but  that  can  be  ignored  in  practice  because  the 
assumptions  under  which  the  lack  of  boundedness  occurs  are  seldom 
satisfied. 

The  State  and  Model-Error  Estimates  as  Conditional  Expectations 

The  aim  here  is  to  reinterpret  the  state  and  model-error  estimates  u 
and  in  (9)  and  (12)  as  the  conditional  expectations  e =E(e/y)  and 
u^=E(tt/y).  To  this  end,  consider  the  general  formula  13]  for  the 
conditional  expectation  of  u given  y: 


E(u/y)  = E(uy*)  EE(yy*) ]~^y. 


(78) 


expressed  in  terms  of  the  'cross-covariance'  operator  E[uy  ] and  the  'auto- 
covariance' operator  E[yy  ].  ^Direct  _^o^mputation  of  these  two  operators 
leads  to  E[uy^]=LL  H*  and  E[yy  ]=I+HLL  H . These  last  two  equations  imply 
that 

Uq  = E(u/y)  = LL*H*[I+HLL*H*]"V»  (79) 

an  equation  that  has  previously  been  recorded  as  (12),  but  that  can  now  be 
used  to  observe  the  equivalence  of  u^  and  E(u/y) . 

Similar  arguments  can  be  used  to  show  that 

E(e/y)=E(ey*)[E(yy*)]"^y,  (80) 

where  E(ey)*=E( 88*h*)=h*  and  E(yy*)=E(h88*h*)=hh*.  Consequently, 

8p=E(e/y)=h*(hh*)“^y,  (81) 
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an  equation  previously  recorded  as  (9)  but  that  now  makes  explicit  the 
equivalence  of  Sq  and  E(e/y). 

The  above  equivalence  has  been  established  using  a somewhat  indirect 
approach  where  the  deterministic  solutions  Bq  and  u^  in  (9)-(12)  were  first 
obtainc»d  using  a least-squares  formulation.  The  deterministic  solutions 
were  then  re-interpreted  probabilistically  as  the  conditional  expectations 
E(s/y)  and  E(u/y)  in  (79)  and  (81),  An  alternative  and  more  direct 
approach  to  establish  that  BQ=E(e/y)  and  Ujj=E(u/y)  would  be  to  introduce  a 
probabilistic  framework  with  u,  e and  y assmed  to  be  random-fields  from  the 
outset*  and  then  to  use  a direct  computation  of  the  conditional 
expectations  to  arrive  at  (79)  and  (81),  Such  an  approach  is  currently 
under  investigation  and  will  be  reported  on  elsewhere  by  the  author. 

8.  CONCLUDING  REMARKS 

This  paper  has  advanced  an  approach  to  the  concurrent  estimation  of 
the  state  and  of  the  model  errors  for  a system  described  by  elliptic 
equations.  The  approach  is  based  on  the  principle  of  least-squares  that 
seeks  to  find  the  model  error  vector  with  smallest  norm  subject  to  linear 
constraints  in  a suitably  defined  function  space.  Solution  of  the 
optimization  problem  leads  to  state  and  model  error  estimates  that  can  be 
computed  either  in  a batch-processing  mode  where  all  of  the  data  is 
processed  simultaneously  or  in  a scanning  mode  where  the  data  is  processed 
recursively.  A probabilistic  interpretation  of  the  deterministic  least- 
squares  solutions  makes  possible  an  'a  priori'  covariance  analysis  of  the 
estimation  error  associated  with  the  estimates.  The  covariance  analysis 
also  provides  formulas  to  evaluate  the  mean-square  estimation  error 
corresponding  to  both  the  state  and  the  model  error  estimates. 
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ABSTRACT 


This  paper  outlines  the  application  of  the  estimation  approaches  of 
Ref.  [1]  to  the  problem  of  static  shape  determination  for  large  antenna 
systems.  The  problem  consists  of  estimating  the  shape  of  an  antenna  sur- 
face from  measurements  of  its  static  deflection.  The  estimation  schemes 
are  based  on  any  one  of  the  modeling  options  of  a single  PDE  for  early 
insight  and  understanding,  coarse-resolution  multiple-PDE  models  for  para- 
metric studies  and  fine-resolution  piecewise-continuum  models  for  detailed 
design.  For  any  one  of  these  three  models,  estimator  design  can  be  devel- 
oped using  an  infinite-dimensional  approach,  where  the  necessary  finite- 
element  truncation  and  approximation  is  conducted  after  the  analytical 
design  has  taken  place,  or  it  can  be  based  on  a f inite-dimensional  approach, 
where  the  model  is  truncated  before  the  estimation  problem  is  formulated. 

One  of  the  main  objectives  of  the  paper  is  to  develop  both  approaches  while 
simultaneously  investigating  their  differences  and  similarities.  Simulation 
results  of  an  application  of  the  finite-dimensional  approach  to  a large 
parabolic  reflector  are  presented.  Similar  large  antenna  simulations  for 
the  infinite-dimensional  approach  are  currently  being  carried  out. 


1 . INTRODUCTION 


The  problem  of  static  shape  determination  consists  of  finding  the  shape 
of  an  antenna  surface  from  multi-point  spatially  distributed  measurements  of 
the  structural  deflections,  as  illustrated  in  Fig.  1(a).  A large  flex- 
ible antenna  will  be  perturbed  from  a nominal  shape  by  disturbances  due, 
for  example,  to  control  actuation  and  external  loads.  To  meet  future  an- 
tenna performance  requirements,  it  will  be  necessary  to  determine  these  de- 
flections quite  accurately,  and  in  some  cases,  to  apply  control  forces  and/ 
or  moments  to  restore  the  desired  antenna  shape.  The  estimation  methods 
described  here  would  constitute  a fundamental  element  of  such  a combined 
shape  determination  and  control  capability. 

Current  methods  for  antenna  calibration  entail  a painstaking  survey  of 
the  entire  surface  of  the  structure,  usually  accomplished  by  a technician 
armed  with  a theodolite  as  in  Fig.  1(b).  Once  the  survey  is  complete,  a 
least-squares  fit  of  the  data  to  the  parameters  of  a paraboloid  is  per- 
formed. The  entire  process  can  take  on  the  order  of  days  and  is  clearly 
inadequate  for  systems  requiring  real-time  autonomous  operation. 
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a)  PROBLEM 


b)  CURRENT  SOLUTION 


o)  FUTURE  SOLUTION 


TARGET 


ACTUAL 

SHAPE 


Fig.  1.  Static  Shape  Determination  Problem 

An  improved  solution  to  the  shape  determination  problem  is  provided  by 
an  integrated  system  in  Fig.  1(c),  currently  under  development  at  JPL,  that 
combines  the  two  mutually  complementary  technologies  of  electro-optical 
sensing  for  shape  determination  and  estimation/identification  methodology 
for  processing  of  the  sensor  data.  Sensing  is  provided  by  SHAPES  [2],  a 
sensor  capable  of  the  real-time,  simultaneous  measurement  of  the  three- 
dimensional  deflections  of  about  50  points  on  a structure  at  a sajjipling 
rate  of  10  hz.  While  Fig.  1(c)  illustrates  the  full  system  capability  for 
static  and  dynamic  shape  estimation  and  for  parameter  identification,  this 
paper  focuses  primarily  on  the  static  shape  estimation  algorithms  as 
described  below. 


2.  ELLIPTIC  MODELS  FOR  SHAPE  ESTIMATOR  DESIGN 


Central  to  the  shape  determination  schemes  under  development  here  is 
the  selection  of  a mathematical  model  to  represent  the  elastic  behavior  of 
the  structure.  The  following  types  of  models  (Fig.  2)  are  useful  at  three 
distinct  stages  of  the  design  process:  1)  single  PDE  models  defined  over  a 

single  domain  for  development  of  initial  physical  insight  and  understanding, 
2)  coarse-resolution  multiple-PDE  models  for  parametric  studies  and  design 
and  3)  fine-resolution  piecewise-continuum  models,  defined  over  a possibly 
fine  structural-element  grid,  for  detailed  design  and  evaluation. 
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a)  SINGLE  PDE 


b)  MULTIPLE  PDE 


c)  FINE  RESOLUTION 


Fig.  2.  Models  for  Estimator  Design 


Single  PDE;  Models 


Such  models  are  most  useful  in  the  early  stages  of  a design,  when  one 
of  the  main  objectives  is  to  create  physical  understanding  and  insight  about 
the  design  problem.  An  example  of  such  a model,  one  that  would  be  applicable 
as  a simple  approximation  to  a parabolic  antenna  system,  is  illustrated  in 
Fig.  2(a).  This  model  has  been  used  to  develop  the  scanning  solutions  for 
data--processing  outlined  in  [1].  The  model  consists  of  the  following  equa- 
tions for  the  out-of-plane  displacement  of  a membrane  in  tension: 


V • (TVu)'  = f. 


’2tt 


frd0dr  = 0, 


(1) 


y(0)  = u(R,  6)  -f  n(6),  for  0 < 0 < 2tt, 


(2) 


defined  over  a circle  of  radius  R in  terms  of  the  polar  coordinates  r and  9. 
The  symbol  T denotes  the  membrane  tension  for  unit  area.  The  free-boundary 
condition  (v  * V)u  = 0 is  assumed  at  the  outer  boundary,  where  v denotes  the 
corresponding  unit  normal.  The  "internal"  condition  u(O,0)  = 0 is 
assumed  to  hold  at  the  origin  r = 0 to  account  for  the  fact  that  the  sensor 
is  assumed  to  measure  the  relative  displacement  between  the  boundary  and  the 
origin.  The  observation  equation  (2),  in  this  particular  case,  assumes  that 
the  data  is  available  continuously  at  the  boundary  of  the  domain.  This 
assumption,  however,  is  not  a major  restriction,  as  other  types  of  observa- 
tions (such  as  discrete  measurements  along  the  boundary)  can  be  encompassed 
with  a siiailar  formulation  as  described  in  Ref.  [1]. 
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An  alternative  approach  to  describing  the  model,  which  will  be  useful 
later  in  applying  the  finite-element  method,  is  based  on  the  following  vari- 
ational formulation.  The  potential  energy  a(u,u)  associated  with  the 
model  is  a (u,  u)  = (1/2) T Vu  • Vu  dS2.  A variational  (weak)  solution  to 
(1)  is  obtained  by  solving  the  following  minimization  problem 


min  J (u)  = a(u,u)  - (u,f),  for  u in  V, 


(3) 


where  V,  commonly  referred  to  as  the  space  of  admissible  functions, 
consists  of  those  functions  which  possess  square-integrable  first-order 
spatial  derivatives  and  which  satisfy  the  geometric  boundary  conditions 
u(0,  6)  = 0 of  the  problem.  The  boundary  condition  (v  • Vu)  = 0 at  the 
circumference  of  the  domain  emerges  as  a natural  condition  by  solving  the 
minimization  problem  in  (3).  Ref.  [3]  contains  a more  precise  statement 
of  the  above  variational  formulation. 

Closely  related  to  the  minimization  problem  in  (3)  is  the  principle  of 
virtual  work  stating  that 


2 a(u,v)  = (v,f),  for  v in  V, 


(4) 


which  is  obtained  readily  from  (3)  as  a necessary  condition  for 
minimization. 

The  value  of  introducing  the  variational  model-statements  in  (3)  and 
(4)  is  that  they  can  be  readily  extended  to  describe  complicated  multlple- 
PDE  models.  While  In  the  simple  case  of  a single  equation,  it  is  equally 
convenient  to  write  either  the  explicit  form  (1)  or  the  variational  forms 
(3)  and  (4),  in  more  complicated  cases  described  below,  the  variational 
formulation  results  in  a somewhat  simpler  statement  of  the  model. 

Coarse-Resolutlon  Multlple-PDE  Models 


These  models  involve  first  subdividing  a structure  into  a few  major 
substructures,  describing  each  substructure  by  means  of  a simple  set  of  par- 
tial differential  equations,  and  then  specifying  appropriate  continuity  con- 
ditions at  the  interfaces  between  substructures.  The  models  that  result 
from  this  process  are  specified  in  terms  of  a few  coupled  partial  differen- 
tial equations  together  with  the  corresponding  boundary  and  interface 
conditions . 

An  example  of  such  a model  is  illustrated  in  Fig.  2(b).  The  structure 
consists  of  an  antenna  dish  with  a prescribed  number  N of  ribs  emerging 
from  a central  rigid  hub.  The  ribs  provide  the  necessary  tension  to  support 
a membrane-like  mesh.  In  the  interest  of  obtaining  a relatively  simple 
model,  the  antenna  dish  is  assumed  to  be  flat,  and  only  the  out-of-plane 
deflection  is  considered.  Additional  simplifying  assumptions  are  that  the 
mesh  can  be  described  by  the  equations  for  a membrane  in  tension  and  that 
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the.  deflection  of  each  rib  is  governed  by  a simple  beam-equation  model. 
The  out-of-plane  deflection  of  the  dish  is  described  by  the  composite  N- 
vector  u(r,6)  = [u]^  (r , 0) , . . . ,U]q  (r , 0)  ] , where  Uj^(r,0)  is  the  displace- 
ment of  the  dish  within  each  subdomain.  In  terms  of  this  displacement 
field,  the  total  potential  energy  can  be  expressed  as: 


a(u,u) 


T Vu(r,0) 


Vu(r , 0)  rd6dr 


(5) 


El [9^u(r,0^)/9r^ 


2 

dr, 


where  Vu  - [ Vu]. , . . . , Vu]^  ] , and  T and  El  are  respectively  the  membrane- 
tension  and  beam-stiffness  coefficients. 

The  variational  principles  in  (3)  and  (4)  above  apply  to  the  multiple 
PDE  models  once  the  potential  energy  (and  the  corresponding  space  V of  ad- 
missible functions)  has  been  specified.  For  the  example  in  Fig.  2(b),  the 
potential  energy  is  given  by  (5).  The  related  admissible  space  V consists 
of  those  displacements  that  are  sufficiently  differentiable  so  that  the 
potential  energy  (5)  is  well-defined  and  that,  in  addition, 
satisfy  the  following  geometric  boundary  and  interface  conditions.  The 
inner-boundary  geometric  conditions  u(0,6jj)  = 9u(0,8jq)/9r  = 0 for  the  rib 
elements  correspond  to  the  assumption  that  the  ribs  are  pinned  at  the  hub. 
For  the  mesh  elements,  the  inner-boundary  condition  u(0,6)  = 0 corresponds 
to  a similar  assumption.  The  natural  boundary  conditions  for  the  ribs 
92u (R,  6jq)  / 9r^  = B^u (R,  6n) /9r3  = 0 and  for  the  mesh  9u(R,0)/9r  = 0 for 
0 < 6 < 0N  emerge  from  the  variational  formulation. 

To  complete  a specification  of  the  admissible  displacements,  it  is 
necessary  to  determine  interface  conditions.  These  conditions,  based  on 
the  assumption  that  the  displacement  field  is  continuous  at  the  interfaces 
between  rib  and  mesh  elements,  can  be  stated  as  ui(r,9j^)  = Uj^^|(r,0)  for 
0 < r < R or  in  the  more  convenient  matrix  form 


u(r,0j^)  = C u(r,  0),  C 


0 10. 

0 0 10 


0 

0 


(6) 


0 0 ....  1 

_ 1 0 ....  0 _ 
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It  is  of  interest  to  note  that  C is  a periodic  matrix  with  all  of  its 
eigenvalues  equally  spaced  on  the  unit  circle,  a property  that  can  be  used 
very  efficiently  to  analyze  the  model  by  means  of  cyclic  symmetry  as  outlined 
in  Ref.  [4] . 

A more  explicit  form,  analogous  to  (1) , of  the  above  coarse-resolution 
multiple-PDE  model  can  be  obtained  by  a set  of  analytical  manipulations 
(closely  related  to  integration-by-parts).  Use  of  the  divergence  theorem 
and  of  the  geometric  boundary  conditions  (6)  in  (4)  results  in 

f(r,0)  - V • [T  Vu(r,6)]j-  • v(r,9)  rd9dr  + 


•9 

N 


•R 


T[9u(r,9)/8r]  • v(r,6)  rd9  + / El  [3^u(r, 0^)/9r^] 


0 


•R 


[3^v(r,ep/8r^]  dr  + j T r“^  [ 8u(r , 0^) /36  • v(r,  0^) 

0 


3u(r,0)/39  • v(r,0)]  dr  = 0, 


(7) 


which  must  be  valid  for  all  admissible  virtual  displacements  v.  This  con- 
dition, after  two  integrations  by  parts  in  the  third  term  and  use  of  the 
geometric  constraints  (6)  in  the  last  term,  results  in  the  following  more 
explicit  set  of  equations  describing  the  model: 


V • (T  Vu)  = f. 


(8) 


3r 


[El  (r,0j^)]  = 7 fe  f ^ u(r,0)  - u(r,6^)]  , 

3r 


(9) 


together  with  the 
9^(R,0N)/9r3  = 0. 
form  of  (1) , with 
Investigation. 


2 2 

natural  boundary  conditions  3u(R,0)/3r  = 3 u(R,0j^)/3r  = 

As  expected,  this  set  of  equations  are  in  the  general 
perhaps  the  exception  of  (9)  which  requires  further 
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Equation  (9)  is  an  interface  condition  reflecting  the  forces  that  the 
mesh  membrane  applies  at  each  of  the  beam  elements.  Some  physical  insight 
can  be  gained  by  transforming  (9)  into  the  following  scalar  notation: 


72  '"^72  =? 


•96  39 


(10) 


in  terms  of  the  scalar  displacements  u.  of  the  individual  rib  elements. 

Note  that  the  term  on  the  right  reflects  the  forces  that  the  adjacent  mem- 
brane elements  apply  on  the  i^'^  rib  element.  This  force  is  dependent  on 
the  difference  in  the  circumferential  slope  (the  partial  derivative  of  the 
displacement  with  respect  to  0)  of  the  membrane  deflection  at  the  interface. 

A different  way  to  interpret  this  equation  is  based  on  the  observation  that 
a rib  deflection,  as  represented  by  the  left  side  of  (10),  would  cause  a 
"jump"  discontinuity  in  the  circumferential  slope  of  the  membrane. 

The  foregoing  equations  represent  the  sought-after  multiple-PDE  model 
for  the  structure.  While  the  model  as  developed  here  involves  only  out-of- 
plane deflections,  a similar  approach  can  also  be  used  for  the  in-plane 
deflections  as  described  in  more  detail  in  Ref.  [4]. 

Some  remarks  are  appropriate  about  the  potential  uses  and  limitations 
of  such  coarse-resolution  models.  Because  of  their  relative  simplicity,  the 
models  are  usually  a very  coarse  approximation  to  the  actual  structure.  For 
instance,  the  dish  has  been  assumed  to  be  flat,  whereas,  in  more  complete 
models,  the  parabolic  dish  geometry  has  to  be  taken  into  account.  Also, 
the  ribs  have  been  modeled  as  simple  beam  elements,  while,  in  reality,  each 
rib  is  in  itself  a complicated  structure  (of  variable  cross-section  and 
curved  geometry)  that  would  require  a more  complete  model  for  its  full 
description.  Because  of  these  and  other  limitations  (such  as  the  use  of  a 
linear  elastic  model  to  approximate  the  actual  nonlinear  mesh  behavior) , it 
is  necessary  to  use  caution  in  applying  these  models. 

Most  of  the  applications  of  the  piecewlse-continuum  models  result  from 
their  ability  to  describe  many  of  the  basic  features  of  the  actual  structure, 
while  still  retaining  a relative  simplicity.  Since  the  models  typically  in- 
volve only  a few  parameters,  they  can  be  quite  useful  in  conducting  para- 
metric design  studies.  For  example,  they  can  be  used  to  establish  the 
sensitivity  of  estimator  performance  to  changes  in  the  structural  stiffness 
properties.  For  the  same  reasons,  the  models  can  also  be  used  in  non-modal 
parameter-identification  approaches  to  estimate  such  parameters  as  the 
average  mesh  tension  T that,  in  a global  somewhat  approximate  sense,  char- 
acterize the  stiffness  properties  of  the  structure. 

In  summary,  while  the  coarse-resolution  multiple  PDE  models  can  be  use- 
ful in  many  cases,  their  inherent  limitations  must  be  understood  in  order 
for  the  models  to  be  applied  correctly.  In  cases  where  more  reso- 
lution is  required,  the  models  described  below  have  a better  potential  (by 
no  means  always  achieved)  for  characterizing  accurately  the  behavior  of 
complicated  structures. 
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Fine-Resolution  Multiple  PDE  Models 


Development  of  these  models  begins  by  first  partitioning  a structure 
into  a possibly  large  number  of  elements,  as  in  the  finite-element  approach 
to  structural  modeling.  After  this  element  grid  has  been  specified,  the 
deflection  within  each  of  its  elements  is  described  by  the  relatively  simple 
PDF's  used  in  the  traditional  elements  (beams,  plates,  membranes,  etc.) 
in  structural  analysis.  To  complete  the  model,  interface  conditions  re- 
flecting continuity  of  the  displacement  field  and  possibly  some  of  its  spa- 
tial derivatives  are  then  specified.  An  illustration  of  the  grid  for  repre- 
senting an  antenna-reflector  model  is  contained  in  Fig.  2(c). 

A fine-resolution  piecewise-continuum  model  would  be  obtained  as  fol- 
lows. The  structural  deflection  within  each  element  would,  as  in  the  previ- 
ous model,  be  described  by  the  symbol  U£(x^),  where  are  the  local  spatial 
coordinates  within  each  grid  element  fip.  The  total  potential  energy  of  the 
system  can  be  expressed  as: 


a(u,u) 


[ D.u. 
1 1 


D.u. ] dfi. , 
11  1 


(11) 


where  D^  are  appropriately  defined  partial  differential  operators,  and  c^ 
are  constants  characterizing  the  elastic  properties  of  each  element.  Note 
that  while  the  local  element  stiffness  has  been  characterized  by  a single 
scalar  (for  the  sake  of  simplicity),  the  formulation  can  be  extended  readily 
to  cases  where  the  stiffness  properties  are  specified  by  several  scalars. 

This  definition  of  the  potential  energy  is  the  key  step  required  to 
apply  either  one  of  the  variational  principles  in  (3)  and  (4)  to  the  fine- 
resolution  PDE  models.  The  corresponding  space  V of  admissible  displace- 
ments is,  as  before,  based  on  the  peculiarities  of  the  problem  under  inves- 
tigation. In  general  terms,  however,  the  displacements  in  this  space  are 
selected  so  that  the  potential  energy  makes  sense  and  so  that  the  geometric 
boundary  conditions  (and  also  the  inter-element  conditions)  are  satisfied. 

Fine-Resolution  Matrix  Differential  Operator  Models 


A more  explicit  matrix-differential-operator  model,  similar  to  that  in 
Eq.  (8),  can  be  developed  for  complicated  structures  requiring  a fine- 
resolution  grid  description  as  in  Fig.  2(c).  To  this  end,  consider  the 
identity 


[D.u.  . D.u.  ] dfi. 

IX  11  1 


[u.  -B.u.  ] dr.  - / [u.  • D'.‘c.D.u.]dS^.  , (12) 

111  1 I 11111  1 
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where  Bj^  denotes  a matrix  differential  operator  defined  over  the  boundary 
of  and  d|  is  for  formal  adjoint  of  Dj[,  For  specific  cases  of  prac- 
tical interest,  this  identity  can  be  obtained  readily  by  a process  analo- 
gous to  integration-by-parts,  as  in  the  case  of  Eq.  (8),  Use  of  (12)  in 
(11)  implies  that  Au  = f,  where  u is  the  composite  vector  u = [u]^, . , . ,unJ  , 
and  A is  a matrix-differential  operator  whose  diagonal  elements  are  DiC^Dj^ 
in  (12).  In  some  cases,  it  is  convenient  to  replace  this  model  Au  = f with 
the  more  general  model 


Au  = Bf , 


(13) 


where  the  operator  B has  been  added  to  reflect  the  fact  that  the  forcing 
term  f = [ f , . . . , f may  not  be  applied  at  all  elements 

The  operator  A in  (13)  may  appropriately  be  referred  to  as  the  "stiff- 
ness operator"  because  it  is  a generalization  of  the  traditional  stiffness 
matrix  of  structural  mechanics.  This  equation  must  be  satisfied  together 
with  a set  of  inter-element  conditions  (emerging  as  natural  conditions  from 
the  first  term  on  the  right  side  of  (12)),  which  reflect  the  continuity  of 
the  quantity  B^u^  at  the  Interelement  boundaries  T^. 

Discussion 


The  piecewise-continuum  models  just  described  simultaneously  retain 
both  the  conceptual  completeness  of  partial  differential  equations  and  the 
versatility  of  the  finite-element  approach  to  modeling  of  complicated  struc- 
tures. Retention  of  the  infinite-dimensionality  of  the  continuum  models 
implies  that  the  model  has  the  potential  for  describing  many  of  the  impor- 
tant physical  characteristics  (which  may  be  important  to  estimation  accur- 
acy) of  the  structure.  For  example,  in  the  antenna  shape  determination 
problem,  it  is  necessary  to  have  an  accurate  description  of  the  antenna  de- 
flection inside  each  grid  element  in  order  for  the  shape  estimates  to  be 
consistent  with  the  relatively  short  wavelength  (4  cm  for  a typical  electri- 
cal operating  frequency  of  2.5  Ghz)  of  intended  operations.  The  ability  to 
retain  such  resolution  makes  the  model  potentially  more  accurate,  although 
finer  resolution  does  not  necessarily  imply  improved  accuracy.  It  is  quite 
possible  that  the  additional  information  being  retained  by  the  fine- 
resolution  model  is  corrupted  by  inevitable  model  errors.  However,  in  spite 
of  these  errors,  the  models  can  still  be  quite  useful  if  they  retain  quali- 
tatively the  significant  effects  of  the  structure  response. 

One  of  the  main  objectives  of  developing  such  models  of  course  is  to 
conduct  the  shape  estimator  designs.  As  will  be  discussed  in  Sec.  3,  this 
design  can  be  based  on  the  abstract-operator  model  in  (13),  without  having 
to  convert  this  model  into  scalar  PDE  notation.  Obtaining  such  a scalar 
model  would  be  impossible  for  structures  of  even  reasonable  complexity,  be- 
cause of  the  large  number  of  coupled  differential  equations,  inter-element 
boundary  conditions  and  coordinate  transformations  which  would  be  Involved. 

However,  before  introducing  the  design  approach  based  on  the  operator 
models  (to  be  done  in  Sec.  3),  it  is  of  interest  to  develop  first  a finite- 
dimensional version  of  (13)  as  described  below. 
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Finite-Dimensional  Models 


An  alternative  to  the  above  infinite-dimensional  models  is  obtained  by 
solving  approximately  the  minimization  problem  (3)  by  means  of  the  finite- 
element  method  [5].  The  finite-element  solution  is  obtained  by  solving  (3) 
within  an  approximation  subspace  typically  consisting  of  piecewise  poly- 
nomial functions  defined  over  a finite-element  structural  grid.  Every 
function  u within  this  subspace  can  be  expressed  as 


u = 


(14) 


where  (|)k  is  a piecewise-polynomial  basis,  and  x are  the  corresponding  co- 
efficients. Substitution  of  (14)  in  (3)  and  performance  of  the  indicated 
minimization  results  in 


KX  = F, 


(15) 


where  K is  a stiffness  matrix  whose  general  element  K = 2a  (cj)]^,  (|)jjj) ; F is 
a vector  whose  elements  Fj^^  = (<|)]^,f),  and  X is  a vector  formed  by  the  coef- 
ficients x^  in  (14). 

Orientation 


One  of  the  central  themes  of  this  paper  is  to  understand  the  relation- 
ship between  the  following  two  approaches  to  estimator  design: 

• an  approach  where  an  infinite-dimensional  model  is  retained  as 
long  as  possible  throughout  the  design  process  while  delaying 
the  necessary  truncation  until  the  analytical  design  has  taken 
place . 

• a finite-dimensional  design  approach  where  the  structural  model 
is  first  truncated  and  the  subsequent  estimator  design  is  based 
on  the  approximate  model. 

Although  in  the  first  approach  the  design  is  obtained  first  and  then  trun- 
cated, in  the  second  approach  these  two  steps  of  design  and  approximation 
are  reversed.  A development  of  the  two  approaches,  together  with  illustra- 
tive examples  and  applications,  is  contained  in  the  next  three  sections  of 
the  paper. 
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3.  INFINITE-DIMENSIONAL  APPROACH  TO  ESTIMATION 


The  infinite-dimensional  approach  to  estimator  design  begins  from  the 
basic  goal  of  minimizing  the  following  quadratic  functional  of  the  model 
errors : 


min  J(f)  = (y  - Hu,y  - Hu)^  + (f,f)2» 

(16) 

where  the  inner  product  notation  is  defined  in  Ref.  [1].  The  optimization 
problem  must  be  solved  subject  to  the  constraints  specified  by  the  model 
equations  Au  = Bf  in  (13),  emerging  from  a single  PDE,  a coarse-resolution 
multiple  PDE  model,  or  a fine-resolution  plecewlse-continuum  model.  These 
constraint  equations  must  be  satisfied  together  with  interelement  condi- 
tions, similar  to  (9)  above,  reflecting  continuity  of  the  displacements 
and  possibly  a finite  number  of  its  spatial  derivatives.  The  observation 
equation  is  again  specified  by  y = Hu  + n. 

Application  of  a calculus  of  variations  approach  [3] 
lowing  boundary-value  problem 

leads  to  the  fol- 

Au  = BB*X, 
o 

(17) 

A*A  = H*(y  - Hu  ), 
o 

(18) 

whose  solution  is  specified  in  terms  of  the  optimal  estimates  u and  the 
adjoint  variables  X.  The  corresponding  model  error  estimates  n^ 
are  provided  by 

f = 
o 

(19) 

n = y - Hu  , 

O 0 

(20) 

in  terms  of  the  previously  determined  variables  and  X. 

The  above  is  a boundary-value  problem  of  the  type  traditionally  en- 
countered [3]  in  solving  optimal  control  and  estimation  problems  with  quad- 
ratic criteria.  Its  solution  leads  to  the  state  and  model  error  estimates 
Uq  and  = [f^jn^].  This  boundary-value  problem  provides  an  equivalent 
alternative  to  the  integral  equation  approach  of  Ref.  [1]  for  computation 
of  the  optimal  estimates. 
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Equivalence  of  Boundary-Value  Problem  and  Fredholm  Integral  Equation 


To  show  the  equivalence,  first  solve  (18)  for  X to  obtain  X = L*H*nQ, 
where  L*  is  the  integral  operator  whose  kernel  is  the  Green's  function  for 
A*.  Substitution  of  this  equation  in  (17)  implies  that  Uq  = LBB*L*H*nQ, 
and  use  of  this  in  the  observation  equation  y = Huq  + nQ  leads  to 


(I  -I-  HLBB*L*H*)n  = y, 
o 


(21) 


which,  upon  setting  B = I,  can  be  recognized  as  the  integral  equation  in 

[1]. 

An  Alternative  Form  for  the  Boundary-Value  Problem 

An  alternative  form  of  the  above  boundary-value  problem  is  obtained, 
in  cases  where  B = I,  by  eliminating  the  adjoint  variables  X in  (17)  and 
(18).  First,  operate  on  (17)  by  A*  to  obtain  the  intermediate  equation 
A*AUq  = A*X.  Then,  use  this  in  (18)  to  arrive  at  (A*A  + H*H)uq  = H*y, 
which  implies  that 


u 

o 


(A*A  + H*H) 


H*y. 


(22) 


It  is  of  interest  to  explore  the  relationship  between  this  state- 
estimate  solution  and  that  obtained  in  [1],  Recall  that  in  [1]  u has  been 
specified  as 


u^  = LL*H*(I  + HLL*H*)  ^ y. 


(23) 


Since  both  (22)  and  (23)  determine  Uq  in  terms  of  the  data  y,  these 
equations  together  imply  the  interesting  identity: 


(A*A  + H*H) 


H*  = LL*H*(I  + HLL*H*) 


(24) 


Solution  of  Boundary-Value  Problem  by  Finite-Element  Method 

The  finite-element  method  [6]  is  applied  here  to  obtain  an  approximate 
solution  to  the  boundary-value  problem  in  (17)  and  (18)  that  characterizes 
the  optimal  estimates.  While  the  general  methodology  of  finite-element 
analysis  can  be  applied  in  a number  of  ways  to  this  problem,  an  approach 
involving  the  consecutive  solution  of  (18)  and  (17)  is  perhaps  the  most 
appropriate . 
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Consider  first  the  following  minimization  problem  whose  solution 
corresponds  to  that  of  (18) : 


min  J(A)=a(A,X)-(HX,n),  (25) 

° 3 

where  a (A, A)  is  the  potential  energy.  To  obtain  an  exact  variational 
(weak)  solution  to  this  problem  requires  that  the  minimization  in  (25)  be 
conducted  in  a space  V of  admissible  functions.  A finite-element  approxi- 
mate solution,  however,  is  obtained  by  conducting  the  minimization  in  an 
approximation  subspace  V^.  The  functions  in  can  be  expressed  as 


0 


A = 


(26) 


where  cf)j^  form  a piecewise  polynomial  basis  in  V]_,  and  Aj^  are  the  corres- 
ponding coefficients.  Substitution  of  (26)  in  (25)  leads  to 


A = R, ^ hf  n , (27) 

1 1 o 

1 N 

where  A = [A  , ...,  A ] is  the  vector  of  nodal-point  adjoint  variables, 
and  , , . . , is  a "modified"  observation  matrix.  A stiffness 

matrix  with  a general  element  R^™  = 2a(c())^,  <t)in)  > has  been  assembled  using 
the  approximation  subspace  V^. 

This  equation  determines  the  values  of  the  adjoint  variables  at  the 
nodal  coordinates,  whereas  the  closely  related  Eq.  (26)  provides  the  neces- 
sary interpolation  inside  the  elements  themselves.  Consequently,  (26)  and 
(27)  together  provide  the  sought-after  solution  of  (18),  and  there  remains 
only  the  task  of  solving  (17)  in  order  to  compute  the  state  estimates  u^. 

The  variational  form  of  (17)  is 


min  J(u^)  = a(u^,u^)  - (Bu^,BA),  (28) 

N 

where  the  minimization  takes  place  in  an  approximation  subspace  V2  that,  in 
general,  may  be  distinctly  different  from  the  subspace  used  to  solve 
(25).  The  functions  in  can  be  expressed  as 


u 

o 


k 


(29) 
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where  is  a piecewise  polynomial  basis  in  V2.  and  x are  the  corresponding 
coefficients  which  form  the  composite  nodal-point  state  estimates  X = 
[x^,,..,x^].  Substitution  of  (29)  in  (28)  results  in  ° 


X 

o 


(30) 


where  the  stiffness  matrix  K2  and  the  related  matrix  C2  are  specified  in 
terms  of  their  general  elements  = 2a((j;].,  and  = (BtJj]j^,B(j)j^) . 

Eqs.  (27)  and  (30),  when  used  in  the  observation  equation 
y = Hu^  + n^  = H2X^  + n^,  imply 

(I  + H2K~^C2K“^H^)nQ  = y,  (31) 

where  H2  = , . . . , This  equation  determines  the  observation  esti- 

mates no  from  the  data  y,  while  the  related  nodal  coordinate  state  esti- 
mates are  determined  from  (27)  and  (30).  The  interpolation  formulas  (26) 
and  (29)  provide  the  value  of  the  estimates  in  that  part  of  the  spatial  do- 
main within  the  individual  finite  elements. 

Illustrative  Example 


Consider  the  following  example  illustrated  in  Fig.  3: 


d^u(?)/dC^  = f(?),  y^^  = u(5j^)  + n^^, 


(32) 


with  the  boundary  conditions  u(0)  = u()l)  = 0,  where  is  the  length  of  the 
structure.  The  objective  of  the  estimation  process  is  to  minimize  the  func- 
tional 


J(f)  = cf2 


M 


k=l 


(33) 


which  is  a special  case  of  (16)  above.  Note  that  for  completeness  a weight- 
ing coefficient  a has  been  introduced  that  can  be  varied  as  a parameter  in 
the  estimator  design. 

The  boundary-value  problem,  analogous  to  (17)  and  (18),  whose  solution 
determines  the  optimal  estimates  is 

d^u^(5)/d?^  = A,  (33) 

d^A(0/d?^  = 0, 


(34) 


(35) 


dA(C^)/d5  - dX(C^)/d5  = . 

with  the  boundary  conditions  u(0)  = u(£)  = A(0)  = A(J,)  = 0.  The  symbols 
dA(5^)/d?  and  dX(^j^)/d5  denote  the  upper  and  lower  limits  of  the  slope 
dA/d|  at  the  measurement  location 


MODEL  dVdC^  = f 
OBSERVATION  y|^=u($l^)  + 


Fig.  3.  Example  to  Illustrate  Estimation  Approaches 


To  obtain  the  solution  of  this  problem  by  means  of  the  finite-element 
method,  it  is  necessary  to  first  convert  the  problem  statement  to  variational 
form  as  in  (25).  Consider  the  following  minimization  problem 


min  J(A)  = a (A, A) 


M 


k=l 


a(A,A) 


[dA/dCl^  d5, 


(36) 


(31) 
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where  X = denotes  the  value  of  the  adjoint  variable  A at  the 

measurement  location  and  = [?k+l>Sk]  ^ subinterval  whose  length 

is  the  distance  between  measurement  points.  To  obtain  an  exact  solution, 
the  minimization  in  (36)  is  to  be  conducted  in  the  space  Vj^  of  continuous 
functions  whose  derivative  is  piecewise  continuous.  While  for  this  simple 
example  it  is  possible  to  obtain  the  exact  solution  to  (36) , the  aim  here 
(in  order  to  illustrate  an  approach  that  applies  to  cases  where  the  exact 
solution  cannot  be  obtained)  is  to  instead  obtain  an  approximate  solution 
based  on  the  finite-element  method. 

Define  first  a finite-element  grid  (see  Fig.  3)  with  the  nodal  points 
[5i,...,5n]>  where  N is  the  total  number  of  nodes.  For  convenience,  the 
finite-element  partitions  are  selected  so  that  M _<  N and  the  set  of  nodal 
points  contains  the  set  of  measurement  points.  The  finite-element  method 
solves  (36)  within  an  approximation  subspace  consisting  of  piecewise 
linear  functions  A with  continuity  of  the  function  at  the  nodal  points. 

The  assumption  that  X is  piecewise  linear  implies  that 

A(?)  = A^  + (5  - 5.)  U - A )/A,  ?-,•<?<  ?,•  .i>  (38) 

where  A^  are  the  nodal-point  values  of  the  adjoint  variable  X.  Substitu- 
tion of  (38)  in  (36)  results  in  a matrix  equation,  similar  to  (27)  above, 
with  a N-by-N  stiffness  matrix  K]^  given  explicitly  by 


(39) 


where  the  subscript  1 denotes  that  the  displacement  function  used  in  form- 
ing Ki  belongs  to  the  approximation  subspace  V^.  It  is  necessary  to  be 
that  precise  about  the  approximation  subspace  used  to  form  this  stiffness 
matrix  because  in  subsequent  discussions  a different  stiffness  matrix  will 
be  formed  for  this  example  that  is  based  on  piecewise-cubic  interpolations, 
as  opposed  to  the  piecewise-linear  functions  used  above. 


590 


Specification  of  the  stiffness  matrix  is  the  key  ingredient  required 
to  carry  out  the  computations  in  (27)  that  determine  the  adjoint  variables 
A in  terms  of  the  observation  error  estimates  hp.  The  objective  now  is  to 
determine  the  state  estimates  u^  in  terms  of  the  adjoint  variable  X,  a goal 
which  is  accomplished  by  a finite-element  solution  of  (17)  above.  Consi- 
der the  variational  problem  equivalent  to  (17): 


min  J(u  ) = a(u  ,u  ) - (u  ,X),  (40) 

o o o o 

where  the  potential  energy  is  specified  in  (37).  While  (40)  is  apparently 
very  similar  to  (36)  , an  important  difference  between  the  two  problems 
is  that  the  last  term  in  (40)  involves  a function  X which  has  already,  in 
(38),  been  determined  to  be  piecewise-linear , whereas  the  last  term  in  (36) 
involves  only  values  of  this  function  at  the  measurement  locations.  Because 
of  this,  accurate  solution  of  (40)  requires  use  of  an  approximation  sub- 
space that  is  more  refined  than  that  used  to  solve  (36). 

A convenient  approximation  subspace  V2  consists  of  the  Hermlte 
piecewise-cubic  polynomials  [5]  which  possess  continuity  of  the  function 
and  its  first  derivative  at  the  nodal  points.  The  functions  in  this 
approximation  subspace  can  be  expressed  as 

Uq(5)  = x^(l  - 35^  + 25^)  + A0^  (1  - 2^  + 5^)5 

+ x^"^^  (3  - 25)5^  + (5  - 1)A^,  (41) 


where  x^  = u and  6^  = du  (J;.)/d5  are  respectively  the  value  of  the 

function  and  its  slope  at  the^noJal  points  Use  of  (41)  and  of  the 

piecewise-linear  approximation  (38)  in  (40)  results  in  an  equation  similar 
to  (30)  with  the  matrices  and  in  (30)  specified  by 
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The  vector  X in  (30)  for  this  example  is  = [x  ,9l,...,x  ,0  ] 
consisting  o?  the  nodal  deflections  x^  and  the  corresponding  nodal  slopes 
0^.  The  vector  A on  the  right  side  of  (40)  consists  of  the  previously 
determined  values  of  the  adjoint  variables  at  the  nodal  points. 

The  solution  for  the  nodal-point  state  estimates  X^  then  proceeds  as 
follows.  Eqs.  (39)  and  (42)  specify  the  necessary  ingredients  to  form  the 
matrix  (I  + H2K2^C2Kp“^Hj)  in  (31) , from  which  the  nodal-point  observation 
error  estimate  n^  is  evaluated  in  terms  of  the  data  y.  Once  n^  is  determined, 
Eq.  (30)  provides  the  desired  nodal-point  state  estimate  Xq. 

Simulation  Results  for  Illustrative  Example 


Assume  that  the  model  in  Fig.  3 is  being  perturbed  from  its  nominal 
shape  by  a force  of  unit  amplitude.  The  actual  deflection,  illustrated 
in  Fig.  4(a),  due  to  this  forcing  function  is  described  by  the  formula 
u(C)  = (l/8)£^  - (1/2) [5  - (1/2)^]^.  A total  of  three  equally  spaced  measure- 
ments at  the  locations  (l/4)£-,  (1/2)^,  and  (3/4)Jl.  produce  the  data  yi  = y3  = 
3£^/32  and  y2  = i^/8  to  be  used  for  estimation. 

Fig.  4(a)  shows  a comparison  between  the  actual  deflection  and  the 
related  estimates  for  three  distinct  values  of  the  coefficients  a in  (33) . 

Note  that  as  the  weighting  on  the  observation  error  is  increased  (by  select- 
ing larger  values  of  a) , estimates  that  are  closer  to  the  actual  deflection 
result.  The  reason  for  this  improvement  is  that  the  forcing  function  of 
f = -1  represents  a relatively  large  process  error,  whereas  the  observation 
error  has  been  assumed  to  be  zero.  Consequently,  estimator  designs  based 
on  a large  weighting  for  the  observation  error  are  more  consistent  with 
the  relatively  high  degree  of  confidence  that  can  be  placed  on  the  measure- 
ments. Note  also  that  because  of  the  piecewise-cubic  interpolation  in 
Eq.  (41)  the  estimated  displacement  can,  by  appropriate  selection  of  a, 
be  made  quite  close  to  the  actual  displacement. 

Fig.  4(b)  shows  a comparison  between  the  actual  process  error  f and 
the  corresponding  estimate  f^  for  the  same  three  distinct  values  of  the 
weighting  coefficient  a.  The  actual  error  f is  a force  of  unit  magnitude, 
whereas  f^  = A is  an  estimate  based  on  the  piecewise-linear  approximation 
(38) . Note  that  the  best  process-error  estimate  is  achieved  for  the  case 
where  the  weighting  coefficient  in  (33)  is  large,  so  that  the  estimates 
are  more  heavily  dependent  on  the  data  than  they  are  on  the  model  equations 
(32).  Note  also  that  close  to  the  ends  of  the  domain,  the  boundary  condi- 
tions A(0)  = A(£)  = 0 cause  the  process-error  estimate  fo  = A to  differ 
substantially  in  a point-wise  sense  from  the  actual  process  error.  This 
discrepancy  at  the  boundaries  can  be  easily  reduced  by  use  of  additional 
measurements  close  to  the  edges  of  the  domain. 
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Fig.  4.  Comparison  Between  Actual  and  Estimated  Deflections  and  Forces 


Discussion 


While  for  the  simple  example  above,  solution  of  (17)  and  (18)  by 
means  of  the  finite-element  method  can  be  readily  achieved,  obtaining 
similar  solutions  for  structures  of  even  relatively  moderate  complexity 
is  a nontrivial  undertaking.  A key  challenge  is  that  there  is  currently 
no  widely  available  automated  means  to  assemble  the  matrices  K2  and  C2  in 
(30),  for  the  approximation  subspaces  that  are  specifically  tailored  to  the 
solution  of  the  boundary- value  problem  (17)  and  (18) . Current  finite- 
element  codes  assemble  a stiffness  matrix  based  on  an  approximation  subspace 
that  is  consistent  with  obtaining  accurate  eigenvalues  and  eigenfunctions. 

The  approximations  used  to  compute  eigenvalues,  however,  may  or  may  not  be 
appropriate  for  the  estimation  (and  the  control)  problem.  The  appropriate 
choices  for  approximation  subspaces  must  depend  on  the  properties  of  the  solu- 
tion as  outlined  below. 
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Examine  the  boundary  value  problem  derived  from  (17)  and  (18)  with 
B = I; 


Let  Uj^  and  be  the  solutions  of  the  corresponding  systems 


where,  as  in  the  illustrative  example, 

H*e^  = <5(x-5j^). 

Then,  by  the  principle  of  superposition  the  solution  of  (42)  can  be 
expressed  as 


(42) 


(43) 


(44) 


" "Z“kV  ('‘5) 

k k 

where  the  coefficients  are  determined  by  the  system  (21) . 

From  (43),  (44)  and  (45),  it  follows  that  the  solution  to  the  estimation 
problem  is  a superposition  of  fundamental  solutions  (Green  functions) 
derived  by  considering  the  response  to  an  impulse  force  at  each  of  the 
measurement  points.  Thus,  for  the  antenna  models  considered  here,  loga- 
rithmic singularities  will  be  present  in  the  impulse  forces  if  measure- 
ments are  to  be  taken  on  the  mesh.  Standard  modal  approximations  cannot 
adequately  cope  with  this  difficulty  when  it  is  a physically  significant 
feature  of  the  estimation  problem. 

The  infinite-dimensional  approach  provides  the  framework  by  which 
finite-element  approximations  that  are  consistent  with  good  estimation  and 
control  accuracy  can  be  systematically  specified.  In  spite  of  its  poten- 
tial, however,  application  of  this  approach  to  complicated  structures  is 
not  readily  achieved  now  and  may  only  be  possible  in  the  future  if  software 
that  automates  the  required  computations  becomes  available. 
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4.  ESTIMATOR  DESIGNS  BASED  ON  FINITE-DIMENSIONAL  MODELS 


Since  in  this  approach  the  system  model  is  first  truncated  to  a 
finite  number  of  dimensions,  the  least-squares  method  for  estimation  has 
to  be  modified  somewhat  in  order  to  apply  to  the  finite-dimensional  model 
(15) . The  key  modification  involves  the  least-squares  criterion  (16) , 
which  for  finite-dimensional  models  becomes  an  ordinary  function  (as 
opposed  to  a functional  for  the  infinite-dimensional  problem) . The  least- 
squares  function  applicable  to  the  finite-dimensional  model  (15)  is 

min  J(F)  = (Y  - HX)"^  (Y  - HX)  + f'^F,  (46) 


where  F and  X are  constrained  by  (15).  The  finite-dimensional  approach  to 
estimator  design  is  based  on  minimizing  (46)  subject  to  these  constraints. 
The  observation  equation  corresponding  to  this  model  is  assumed  to  be 
Y =•  HX  + N,  where  as  before  Y is  the  data,  N is  the  observation  error,  and  H 
is  the  observation  matrix. 

Application  of  a variational  approach  to  (46)  leads  to  the  following 

solutions  for  the  state  and  model  error  estimates  X^  and  Eo  = [F  ,N  ] : 

° ° o o 

oT  -9T-1  T T-1 

Xq  = K“2  h [I  + HK  H ] Y,  = h (hh  ) Y,  (47) 


where  h = 


[HK  ^ 


I I],  or  in  more  explicit  notation. 


F 

o 


-IT  -2  T -1 

K H [I  + HK  H ] Y, 


-2  T-1 

N = [I  + HK  H ] Y . 
o ^ 


(48) 


These  equations,  involving  finite-dimensional  matrices,  are  analogous 
to  the  linear  operator  formulas  presented  in  Ref.  [1]  for  the  infinite- 
dimensional problem.  For  Instance,  Eq.  (10)  in  Ref.  [1]  states  that 
the  observation  error  estimate  n^  is  specified  by  n = (I  HLL*H*)y^  in 
terms  of  the  data  y.  The  similarity  between  this  equation  and  (48)  can  be 
ascertained  by  inspection.  Similar  analogies  can  be  drawn  between  (47)  - 
(48)  and  the  infinite-dimensional  solutions.  The  main  difference  between 
the  two  sets  of  equations  is  that  (47)  and  (50)  involve  matrices,  whereas 
[1]  contains  their  infinite-dimensional  counterparts. 


These  are  also  close  similarities  between  Eqs.  (47)  and  (48)  specifying 
the  estimates  based  on  a finite-dimensional  model  and  (27)  , (30)  and  (31) 
which  apply  to  the  infinite-dimensional  approach.  The  main  difference 
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between  the  two  sets  of  equations  is  that  the  stiffness  matrices  in  (31) 
are  formed  using  an  approximation  subspace  that  is  consistent  with  the 
estimation  problem,  whereas  the  stiffness  matrix  in  (47)  and  (48)  was  deter- 
mined prior  to  the  problem  formulation. 

Adjoint  Variable  Approach 

An  alternative  method  for  solving  the  constrained  minimization  problem 
in  (46)  involves  adjoining  the  constraints  (15)  to  (46)  by  means  of  a set 
of  adjoint  variables  A to  obtain 


min  J(F,X,A)  = (Y  - HX)”^  (Y  - HX)  + f’^F  + a”^(KX  - F).  (49) 

Use  of  the  adjoint  variables  converts  the  original  constrained  minimization 
problem  to  one  Involving  no  constraints.  A set  of  conditions  for  opti- 
mality for  this  problem  is 


KX  = A,  (50) 

o 

KA  = h'^(Y  - HX^),  (51) 


whose  solution  results  in  the  nodal-point  state  estimate  X^.  The  related 
nodal-point  model  error  estimates  are  specified  by  F^  = A and  = Y - HX. 

To  show  equivalence  of  (50)  and  (51)  with  (47)  first  solve  (51)  for 
A in  terms  of  N as  A = K“^h'^  N^.  Substitution  of  this  equation  in  (50) 
leads  to  Xq  = K-2hT  N^.  Since  Y = HXq  + Nq,  then  [I  + HK“2hT]No  = Y , 
an  equation  previously  recorded  in  (48). 

If  a different  set  of  manipulations  is  applied  to  (50)  and  (51),  a 
distinctly  different  set  of  equations  results.  Multiplication  of  (50)  by 
K and  substitution  in  (51)  imply  that 

(K^  + h\)X  = h'^Y,  (52) 

o 


an  equation  that  can  be  used  as  an  alternative  to  (47)  in  solving  for  the 
nodal-point  state  estimates  X^  in  terms  of  the  data  Y.  Solution  of  (52) 
requires,  however,  inversion  of  the  matrix  (r2  + h'^H)  whose  dimensions 
are  equal  to  the  total  number  of  degrees  of  freedom  in  the  finite-element 
model.  The  Inversion  of  (I  + HK~^h'^)  required  by  (47)  is  typically  easier 
to  carry  out  because  this  matrix  has  dimensions  equal  to  the  number  of 
data  points,  and  this  number  is  usually  smaller  than  the  number  of  nodes. 
Eq.  (47),  on  the  other  hand,  is  not  totally  free  from  inversion  of  matrices 
of  large  dimension  because  of  the  need  to  Invert  the  stiffness  matrix  K in 
forming  the  matrix  (I  + HK~2h'^) . Note  parenthetically  the  similarities 
between  the  matrix  equation  Eq.  (52)  and  the  previous!’'  developed  operator 
equation  (22) . 
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A final  alternative  to  (47)  in  solving  the  minimization  problem  (46) 
is  to  observe  that  (50)  and  (51)  can  be  rearranged  as 
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Upon  inversion  of  the  matrix  on  the  left  side  of  this  equation,  it  is  pos- 
sible to  solve  simultaneously  for  the  nodal-point  state  estimates  and  the 
corresponding  adjoint  variables.  However,  an  undesirable  feature  of  (53) 
is  that  it  involves  inversion  of  a matrix  of  dimensions  equal  to  twice  the 
nuaber  of  degrees-of-f reedom  in  the  finite-element  model.  An  additional 
possibly  undesirable  feature  of  (53)  is  that  the  matrix  that  must  be  inverted 
is  not  symmetrical,  whereas  both  (47)  and  (52)  involve  inversion 
of  symmetric  matrices. 

Interpolation  Within  Finlte-Elemen t s 


The  foregoing  provides  the  values  of  the  estimates  Xq  only  at  the  finite 
number  of  nodal  points.  Interpolation  in  regions  inside  the  finite- 
elements  is  provided  by  Eq.  (14),  which  has  been  used  originally  to  form 
the  stiffness  matrix  K in  (15)  associated  with  the  finite-dimensional  model. 

Covariance  Analysis  with  White-Noise  Model  Errors 


An  "a  priori"  covariance  analysis  of  the  estimation  errors  inherent  in 
the  state  and  model  error  estimates  (47)  above  can  be  conducted  in 
the  same  spirit  as  that  of  Secs.  6-7  of  Ref.  [1].  The  analysis  is  based 
on  the  assumption,  not  used  in  the  deterministic  least-squares  approach 
applied  to  arrive  at  (47),  that  the  actual  errors  e = [F,  N]  are  white-noise 
with  a unit  covariance-matrix.  In  other  words,  it  is  assumed  that  E(ecT)  = I, 
v/here  I is  the  appropriately  dimensioned  unit  matrix. 


It  is  convenient  to  first  review  the  relationships  between  actual  and 
estimated  errors  developed  in  Sec.  6 of  Ref,  [1]  as  they  apply  to  the 
finite-dimensional  problem  in  this  section.  Note  that  (47)  implies  that 
Eg  = We  .and  Gp  = (I  ~ W) e , where  W is  the  matrix  W = h-*- (hh'^)h^.  As  in  [1], 
W and  (l  - W)  are  projection  matrices  in  the  sense  that  = W and  (I  - W)^ 
= (I  - W) . This  Implies  that  the  optimal  error  estimates  Eq  and  the  model 
error  estimation  error  e = e - Eq  are  orthogonal  complements  since 

The  orthogona5lty  of  Eq  and  e„  implies  that  the  square  of  the 


norm  e'^e  of  a model  error  vector  e of  nohminimum  norm  can  be  expressed 


e'^E  = e'^  e. 


+ E-*-E  , a relationship  that  has  been  illustrated  graphically 


“ “ ■'O  “O  ■ “p“P’  “ “““  — o- 

in  Fig.  5 of  Ref.  [1]-.  These  are  the  key  relationship  required  to 
conduct  the  covariance  analysis  of  the  estimation  errors. 


The  covariance  matrix  of  the  model-error  estimation  error  Cp  = e - Eq 
is  given  by 
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(54) 


E[epEp]  = E[(I  - W)ee^(I  - W) ] = (I  - w) , 


or  in  more  explicit  notation 
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The  diagonal  terms  in  this 

last  equation  imply  that 

(55) 
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o 

1 
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= Tr[I  - K H (I  i-  HK  H ) HK  ], 

(56) 

E 

o 

1 

11  ^ = Tr  [I  - (I  + HK“V)"^], 

(57) 

which  correspond 


respectively  to  the  mean-square  estimation  error  of  the 

Use  of  the 


process-error  estimate  F and  the  observation  error  estimate, 
identity  [I  - K + 8k  -^]  = [I  + H^K  i 
in  the  following  more  convenient  expression  for  E]]  F - F [j 


E F - F 


= Tr  (I  + 


—1  T — 1 ■ 

K H H^K  ) 


(58) 


To  evaluate  the  state  estimation  error  X = X - X , note  first  that 
KX  = F and  KXp  = F imply  that  KX  = F„  and  Xp  = K“^F„?  Hence,  E[X;  X '^]  = 
K“1e  [FpFp]K“l  which  together  with  (58)  leads  to  ^ ^ 


X - X 


2 -1 
= Tr[K  (I 


_1  T -1  _1 

-1-  K HH  K ) K ]. 


(59) 


Note  the  similarity  between  (59)  and  its  infinite-dimensional  counterpart 
(67)  of  Ref.  [1] . 


Illustrative  Example 


Consider  again  the  example  in  Fig.  3.  By  application  of  the  finite- 
element  method,  it  is  possible  to  arrive  at  the  model  KX  = F,  where 
X = [xl,...,x^]  is  a vector  of  nodal  deflections;  K is  a stiffness 
matrix  given  in  (39);  and  F = [Fx,...,Fn]  is  a vector  of  applied  forces. 

This  truncated  model  can  be  obtained  readily  by  a process  quite  similar 
to  that  used  in  arriving  at  Eq.  (39).  The  interpolation  inside  elements 
is  provided  by  the  following  linear  interpolation,  u(5)  = x^  + (^  - g,) 

(xi+1  - xi  )/A,  for  X:  which  is  Identical  to  the  one  used  in  (38). 

The  estimation  problem  to  be  solved  consists  of  obtaining  an  estimate  of 
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the  static  deflection  and  of  the  applied  force  from  the  three  equally  spaced 
measurements  of  the  deflection. 

Fig.  5 shows  a comparison  between  the  deflection  estimates  obtained 
from  the  two  approaches.  The  key  difference  between  the  estimation  ap- 
proaches becomes  apparent  upon  inspection  of  Figs.  5(a)  and  5(b).  In 
Fig.  5(a),  the  estimated  deflection  is  obtained  by  first  solving  for  the 
nodal-point  deflection  estimates  and  then  using  a piecewise-1 inear  inter- 
polation to  arrive  at  the  estimates  within  the  f inite- element . On  the 
other  hand.  Fig.  5(b)  is  based  on  a piecewise  cubic  approximation  within 
each  element.  Whether  the  loss  accuracy  Inherent  in  the  lower-order 
approximation  in  Fig.  5(a)  can  be  overcome  by  use  of  a finer  subdivision 
of  the  domain  is  an  option  that  should  be  investigated  for  each  particular 
problem.  For  this  example,  the  intuitive  notion  has  been  verified  that 
increasing  the  number  of  nodes  in  the  finite-dimensional  approach  results 
in  estimated  solutions  that  become  closer  to  those  obtained  with  the 
infinite-dimensional  technique.  This  has  been  done  by  computing  the  root- 
mean-square  (RMS)  value  of  the  estimation  error  associated  with  the 
infinite-dimensional  and  finite-dimensional  solutions.  The  infinite- 
dimensional  solution  results  in  an  RMS  estimation  error  of  1.2  mm,  whereas 
the  finite-dimensional  approach  with  4 finite  elements  has  an  error  of 
5.9  mm.  Increasing  the  number  of  elements  to  12  results  in  an  estimation 
error  of  1.5  mm,  which  compares  quite  favorably  with  that  obtained  using 
the  infinite-dimensional  method. 


.25  .50  .75  1.00 

NORMALIZED  POSITION 


Fig.  5.  Comparison  of  Estimated  Solutions  With  Four  Finite  Elements 


Discussion 


The  k€iy  feature  of  the  finite-dimensional  approach  to  estimator  design 
is  that  it  is  based  on  a model  that  has  been  truncated  without  taking  into 
account  the  subsequent  problem  to  which  the  model  is  to  be  applied.  For 
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example,  Eq.  (15),  which  constitutes  the  finite-element  model,  was  devel- 
oped before  the  estimation  problem  in  (46)  was  formulated.  Selection  of  the 
grid  size  as  well  as  the  order  of  the  interpolating  polynomials  in  the 
approximation  subspace  has  been  made  without  prior  knowledge  of  the  esti- 
mation problem  (46).  In  the  case  of  the  Illustrative  example,  the  plecewlse-- 
linear  interpolation  (38)  is  not  necessarily  optimal  in  solving  the  estima- 
tion problem,  whereas  it  may  be  completely  satisfactory  to  obtain  a char- 
acterization of  the  structural  static  response.  Note  also  that  because 
the  finite-dimensional  approach  uses  a truncated  model  (15)  as  its  basic 
premise,  the  corresponding  estimation  problem  (46)  must  also  be  finite- 
dimensional.  This  finite-dimensional  problem  (46)  is  not  necessarily  the 
most  appropriate  estimation  problem  that  can  be  solved  to  obtain  the  best 
static  shape  estimates  for  a given  structure.  The  one  ingredient  missing 
in  (46)  that  is  important  to  the  problem  of  static  shape  estimation  is  the 
ability  to  account  for  the  behavior  of  the  structure  within  each  finite- 
element.  The  quadratic  functional  in  (16),  on  the  other  hand,  is  a more 
complete  criterion  because  through  its  last  term  (f,  f)„  and  by  use  of  the 
infinite-dimensional  model  (13),  it  is  able  to  account  for  this  behavior. 

In  some  cases,  the  difference  between  the  two  approaches  can  be  reduced 
where  it  is  feasible  to  select  a sufficiently  fine  finite-element  grid  in 
forming  the  finite-dimensional  model  (15).  In  other  words,  the  two  approaches 
can  probably  be  made  to  yield  substantially  the  same  results.  However,  even 
if  in  the  end  the  finite-dimensional  formulas  (47)  are  used,  it  is  convenient 
to  keep  in  mind  the  approximations  inherent  in  those  formulas. 


5.  APPLICATION  TO  A LARGE  ANTENNA  MODEL 


Simulations  are  currently  being  conducted  to  apply  the  foregoing  esti- 
mation methodology  to  the  problem  of  large  antenna  static  shape  determina- 
tion. While  both  approaches  are  under  development,  most  of  the  numerical 
work  to  date  has  focused  initially  on  the  finite-dimensional  approach. 

The  model  used  for  simulation  is  that  of  a 55-m  diameter  parabolic 
reflector  formed  by  a total  of  48  rib  elements  emerging  from  a common  cen- 
tral hub.  A membrane-like  mesh  is  supported  in  tension  by  the  rib  elements. 
Such  reflectors  have  been  under  recent  Intensive  investigations  for  poten- 
tial communication,  radiometry  and  radio-astronomy  missions  as  well  as  for 
possible  shuttle-attached  flight  experiments.  The  finite-element  model 
used  in  the  simulations  is  that  of  Ref.  [7]. 

The  simulations  involved  reconstructing  the  antenna  surface  from  a set 
of  discrete  measurements  as  shown  in  Fig.  6.  In  each  simulation,  a defor- 
mation of  a nominal  antenna  shape  was  produced  by  a linear  combination  of 
the  first  18  modes  provided  by  a finite-element  model  of  the  antenna,  as 
illustrated  in  Fig.  6(a).  The  estimator  performance  in  a batch  mode  was 
then  investigated  for  several  sensor-placement  configurations.  Fig.  6(b) 
shows  an  illustration  of  the  estimator  performance  for  a case  where  a total 
of  12  discrete  measurements  of  the  deflection  around  the  circumference  of 
the  reflector  are  available  for  estimation.  Parenthetically,  it  should  be 
pointed  out  that  a parallel  investigation  (based  on  methods  similar  to  the 
ones  outlined  in  this  paper)  of  the  problem  of  figure  control  has  also  been 
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under  way.  Illustrations  of  figure  control  performance  with  a total  of 
eight  actuators  mounted  at  the  central  hub  is  shown  in  Fig.  6(c).  Most  of 
the  estimation  results  presented  here  and  in  Ref.  [1]  have  a parallel  (dual) 
interpretation  in  terms  of  figure  control. 


DISTORTED  ESTIMATED  CORRECTED 


Fig.  6.  Shape  Estimation  and  Control  Performance 
for  55-ra  Diameter  Antenna  Model 


One  of  the  uses  that  can  be  made  of  the  "a  priori"  covariance  analysis 
of  the  previous  section  is  to  analyze  the  problem  of  sensor  placement.  For 
example.  Fig.  7 illustrates  the  sensitivity  of  the  mean-square  state  esti- 
mation error  as  given  by  Eq.  (59)  to  small  repositionings  of  a set  of  12 
sensors  mounted  on  the  circumference  of  the  dish.  Note  that  the  estimation 
error  is  significantly  reduced  by  a 5°  rotation  in  8 of  the  targets  at  the 
boundary  or  by  selecting  a nonuniform  placement.  The  reason  for  this  sensi- 
tivity is  the  presence  of  spatial  correlations  in  the  displacement  at  the 
boundary  of  the  type  discussed  in  detail  in  Sec.  5 of  Ref.  [1].  Sensitivity 
to  sensor  placement  is  one  of  the.  areas  currently  under  intensive  investi- 
gation as  will  be  discussed  in  more  detail  in  a forthcoming  publication  [7]. 
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Fig.  7.  Estimation  Error  Sensitivity  to  Sensor  Placement 


6.  SUMIARY  AND  CONCLUDING  REMARKS 


The  paper  provides  a complement  to  Ref.  [1]  by  describing  in  detail 
how  the  general  state  and  model  error  estimation  methodology  can  be  applied 
to  specific  structures.  For  structures  modeled  by  a single  PDE,  multiple 
PDF's  or  a fine-resolution  piecewise-continuum  model,  two  approaches  for 
estimator  design  have  been  developed.  The  first  approach  involves  the  solu- 
tion by  means  of  the  finite-element  method  of  the  boundary-value  problem 
associated  with  the  optimal  estimates.  The  second  approach  is  based  on  ob- 
taining a finite-element  model  first  and  then  conducting  the  estimator  de- 
sign based  on  the  truncated  model.  The  similarities  and  differences  between 
the  two  approaches  have  been  outlined  in  a general  framework  and  with  a 
simple  illustrative  example.  Simulation  results  of  the  application  of  the 
finite-dimensional  approach  to  the  problem  of  static  shape  determination  of 
a large  antenna  model  have  been  presented.  While  the  analytical  foundation 
for  both  approaches  has  been  established,  large-scale  numerical  simulations 
for  the  Infinite-dimensional  approach  are  still  under  development. 
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SESSION  VIII;  DISCUSSION  — FUTURE 
RESEARCH  OPPORTUNITIES 

Moderator:  H.A.  Rediess,  Milco  International,  Inc. 

Panel  Members:  A.V.  Balakrishnan,  University  of  California,  Los  Angeles, 

J.A.  Breakwell,  Lockheed  Missiles  and  Space  Company,  A.E.  Bryson,  Jr.,  Stanford 
University,  P.C.  Hughes,  University  of  Toronto,  and  W.E.  Vander  Velde,  Massachusetts 

Institute  of  Technology 

SYNOPSIS* 

The  intent  in  this  session  was  to  provide  views  on  the  state  of  distributed 
control  systems'  theory  and  application  and  to  draw  together  a collection  of 
suggested  research  opportunities  to  present  to  NASA.  Each  panel  member  gave  an 
opening  statement,  then  a general  discussion  with  the  audience  followed. 

Rediess  concluded  the  session  by  stating  a synthesized  set  of  recommendations. 

H.  A.  REDIESS:  In  Holt  Ashley's  talk  last  night,  there  was  one  particular  law 

of  Augustine's  that  he  presented,  the  one  about  it  taking  one-third  more  time  to 
complete  a task  than  is  estimated  at  any  point  in  time  in  the  project.  I think 
there  is  a corollary  to  that  of  sorts:  "Project  managers  always  freeze  the 

control  systeim  design  before  the  controls  engineer  feels  it  is  ready  to  be 
frozen."  In  spite  of  that,  the  systems  usually  work  reasonably  well.  There  are 
often  minor,  sometimes  even  major,  problems  that  are  found  in  ground  testing, 
in  closed-loop  tests,  in  vibration  tests,  or  on  into  flight  testing.  But  quite 
often  the  design  flaws  that  are  found  later  have  to  be  fixed  with  notch  filters 
or  other  type  of  changes  in  the  compensation.  And  this  goes  all  the  way  up  into 
the  flight-test  phase  of  the  program.  Changes  late  in  the  development  process 
are  usually  very  expensive,  particularly  if  we  are  talking  about  reverification 
of  the  software.  Many  times,  the  changes  that  one  would  like  to  make  because  of 
problems  found  well  down  into  the  process  do  not  get  made  because  it  is  just  too 
costly  or  too  time-consuming.  An  interesting  point  about  many  of  these  problems 
is  that  they  are  generally  not  a surprise  to  the  control  engineer.  The  problems 
often  arise  in  areas  where  he  knew  that  he  was  using  an  outdated  model.  Some- 
times the  structural  model  is  several  Iterations  back  in  the  configuration  by 
the  time  that  the  controls  person  gets  it  for  his  analysis  and  design  work. 
Another  possibility  is  that  the  controls  design  engineer  has  had  to  accept  a 
compromise  solution  in  his  design,  one  that  he  does  not  particularly  like 
because  of  constraints  that  are  beyond  his  control  or  because  he  has  literally 
run  out  of  time  or  resources  to  iterate  further. 

That  brings  me  to  my  point.  In  considering  partial  differential  equations 
(PDE's),  ordinary  differential  equations,  or  other  theory  in  this  workshop,  it 
seems  to  me  that  the  primary  thing  we  are  talking  about  is  providing  a set  of 
tools  which  designers/analysts  can  use  to  achieve  a better  design  at  the  point 
where  the  project  manager  freezes  it.  I mean  better  in  several  ways,  certainly 


*This  synopsis  attempts  to  capture  the  main  points  discussed  but  has  not  been 
reviewed  or  endorsed  by  the  speakers. 
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in  terms  of  just  being  able  to  achieve  a higher  degree  of  confidence  in  the 
design  when  it  goes  into  flight.  If  PDE  models  can  provide  a better  physical 
understanding  of  the  process,  as  suggested  by  Professor  Russell,  they  might  help 
us  to  recognize  essential  differences  of  the  control  process  in  the  early  con- 
ceptual design  phase  of  the  problem  in  which  you  would  like  to  do  some  parametric 
studies  and  interact  with  other  disciplines. 

If  PDE  approaches  can  provide  a better,  more  structured  approach  to  estab- 
lishing or  developing  robustness  in  the  system,  they  might  give  us  much  better 
confidence  that  the  final  system  will  perform  in  the  presence  of  component  uncer- 
tainties and  variations.  And  if  we  are  really  fortunate  and  come  up  with  a 
totally  different  design  approach  that  is  in  some  way  better  through  PDE's,  as 
suggested  by  Colonel  Washburn,  we  might  be  able  to  simplify  the  design  process 
greatly.  It  seems  like  there  are  several  good  theoretical  concepts  in  the  com- 
munity that  have  promise.  Most  of  these  will  not  probably  lead  to  practical 
tools,  but  maybe  a few  will.  That  is  the  nature  of  research. 

A.  V.  BALAKRISHNAN:  I formally  suggest  that  some  part  of  the  effort  of  large 
space  systems  research  be  directed  toward  the  use  of.  continuum  models.  To  me 
this  is  almost  an  obvious  statement,  but  after  listening  to  some  renowned 
experts  here,  who  apparently  do  not  recommend  this  point  of  view,  I will  add 
some  reasons  why  this  direction  may  not  be  totally  useless. 

In  the  continuum-model  approach,  the  structure  is  modeled  by  a partial 
differential  equation,  and  the  crucial  fact  is  that  the  optimal  controller 
design  is  based  on  the  PDE  model.  Then  that  optimal  system  is  approximated. 

This,  of  course,  is  in  contrast  to  discretizing  the  system  first. by  truncating 
at  the  modal  level  or  at  the  finite-element  level,  and  then  designing  a control 
system  for  that  truncated  model.  Everybody  knows  how  to  develop  designs,  at 
least  in  theory,  based  on  this  second  approach.  However,  in  doing  so,  you  are 
subject  to  what  I would  like  to  suggest  are  serious  drawbacks.  The  truncation 
in  the  second  approach  is  natural  if  you  are  considering  the  calculations  of 
the  open-loop  response.  But  if  optimal  control  is  your  design  objective  then 
it  is  not  necessary  to  solve  those  equations  characterizing  the  open-loop 
response.  One  can  go  directly  to  calculating  the  control  gains.  Discretizing 
first  can  obscure  fundamental  phenomena  as  referred  to  earlier,  such  as  the 
impossibility  of  guaranteeing  a stability  margin  unless  the  system  is  already 
exponentially  stable.  But  I think  the  most  compelling  reason  is  that  of  robust- 
ness which,  in  the  light  of  discussions  here,  is  clearly  one  of  the  important 
issues  in  the  control  of  large  space  structures.  It  is  generally  agreed  that 
there  are  always  uncertainties  in  the  model.  The  remedy  suggested  is  that  we 
have  on-line  identification  to  update  the  parameters.  In  that  case,  we  are 
then  talking  about  an  identification  phase  and  then  the  controller  based  on  the 
identification.  It  is  here  that  the  problem  of  modeling  becomes  crucial.  The 
one  advantage  that  the  PDE  model  has  is  that  of  simplicity.  This  simplicity  is 
more  than  a qualitative  simplicity.  It  is  reflected  in  the  fact  that  if  you 
estimate  parameters  of  the  PDE  model  rather  than  all  the  modes  and  the  modal 
frequencies,  it  is  far  more  efficient.  Furthermore,  we  do  have  now  considerable 
experience  on  such  on-line  identification.  So  the  point  is  that  if  robustness 
is  a main  criterion,  then  the  description  of  the  system  itself  should  be  simple 
enough,  because  the  larger  the  number  of  parameters  you  have  to  estimate,  as  is 
well  known,  the  more  the  error  you  introduce.  Moreover,,  you  only  need  to 
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estimate  those  parameters  that  affect  the  control  gain,  nothing  else.  I think 
we  can  perhaps  substantiate  this  claim  theoretically,  but  I know  that  will  not 
satisfy  many  of  you,  so  we  should  also  try  to  satisfy  this  in  practice. 

And  so  I think  that  some  part  of  the  large-space-structures  effort  should 
be  directed  toward  the  use  of  continuum  models.  I do  not  mean  merely  the  token 
support  to  universities  with  graduate  students  doing  the  work,  but  I mean  a 
real  substantial  practical  model,  the  results  of  which  would  be  acceptable. 

J.  A.  BREAKWELL:  I would  like  to  advocate  two  small  fields  that  I have  brushed 

up  against.  It  is  fairly  clear  that  there  is  an  immense  collection  of  tools 
assembled  to  attack  the  problem  and  I think  that  it  is  about  time  we  used  some 

of  them.  The  ACOSS  program  is  a good  example  of  the  customer  providing  a real 

model  and  requirements  to  the  controls  community  and  asking  them  to  solve  the 

problem.  That  is  a very  good  idea  and  NASA  would  do  well  to  follow  suit.  I 

also  advocate  small  experiments,  which  are  becoming  more  feasible  because  they 
are  becoming  cheaper.  The  reason  that  1 advocate  actually  getting  your  hands 
dirty  is  that  a lot  of  the  problems  that  are  anticipated  might  turn  out  to  he 
not  such  bad  problems  after  all,  but  you  might  find  other  problems  that  are 
more  important.  In  the  former  category,  I think  there  are  several  paper  tigers 
that,  once  you  have  a specific  example  in  mind,  tend  not  to  be  quite  as  big  a 
problem  as  you  might  imagine.  Global  stability  is  the  first  thing  that  comes 
to  mind.  Robustness  is  another  popular  subject  right  now  which  you  may  find  is 
not  as  much  of  a problem  when  you  go  to  experimentation. 

To  tell,  in  general,  what  kind  of  mode  you  want  to  keep  and  what  you  do  not 
want  to  keep  in  model  reduction  is  a rather  difficult  question.  But  when  you 
have  the  modes  in  hand  and  you  can  put  your  fingers  on  them,  I think  you  could 
tell  quite  easily.  So  those  are  the  category  of  problems  that  might  go  away 
once  you  start  doing  experimental  work.  There  will  also  be  other  problems  that 
come  up.  There  are  two  that  I have  seen  and  would  really  like  to  see  attacked. 
The  first  is  the  selection  of  the  weighting  matrices,  or,  for  filters,  the 
covariance  matrices.  There  is  not  much  that  has  been  said  about  this,  but  the 
assumption  of  a white-noise  covariance  in  a truncated  system  is  a fiction.  I 
would  like  somebody  to  develop  a rather  good  theoretical  test  procedure  for 
developing  covariance  inputs  to  the  filter. 

The  second  unexpected  problem  I have  seen  is  large  variations  in  the  low 
frequency  modes.  In  laboratory  experiments,  you  will  see  rather  large  varia- 
tions in  the  position  of  the  very  low  frequency  modes  well  within  your  filter 
bandwidth,  as  much  as  you  will  see  perturbations  in  the  first  unmodeled  modes 
(spillover).  That  is  a problem  that  I have  not  seen  addressed  theoretically. 

Having  taken  a very  pragmatic  approach  to  some  of  the  things  I advocate,  I 
would  now  like  to  turn  to  partial  differential  equations.  There  may  be  times 
when  it  is  appropriate  to  use  partial  differential  equations  to  model  large 
systems,  sometimes  not.  That  is  not  the  question.  The  partial  differential 
equation,  the  sophisticated  mathematics  that  we  have  seen  in  the  last  couple 
of  days,  provides  a much  more  convenient  formulation  of  the  problem.  It  allows 
a much  more  direct  attack  to  the  control  problem  and  to  the  modeling  problem. 

It  takes  away  the  burden  of  the  notation,  the  infinite  number  of  subscripts  that 
we  are  saddled  with  in  the  discrete  approach,  and  allows  you  to  think  about  the 
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problem.  It  is  entirely  conceivable  to  me  that  there  may  never  be  a successful 
application  directly  of  this  theory.  But  still,  if  any  breakthroughs  in  basic 
control  theory,  which  later  will  be  passed  along  to  the  engineers,  are  to  be 
made,  they  must  be  made  there.  There  is  no  chance  that  the  old  calculus  of 
variations,  or  the  Weiner  approach,  is  going  to  yield  very  much  more  in  the  way 
of  new  useful  theoretical  approaches.  I say  that  we  must  not  let  the  control 
theory  stagnate. 

A.  E.  BRYSON,  JR.:  Well,  I very  much  like  the  ideas  expressed  the  first  day  by 

Dick  Gran  and  Peter  Hughes.  I have  been  advocating  ideas  similar  to  those. 

There  is  a finite  bandwidth  for  any  system  that  you  are  trying  to  design.  The 
idea  of  having  an  infinitely  wide  bandwidth  is  sort  of  ridiculous.  When  you  do 
decide  What  the  disturbances  are  and  what  your  criteria  are  you  can  pretty  well 
establish  the  closed-loop  bandwidth  required,  and  any  structural  mode  that  has 
a frequency  of  say  10  times  that  does  not  have  anything  to  do  with  the  problem. 

I like  Dick  Gran's  idea  of  the  unstably  interacting  modes.  That  is  a very  nice 
concept  and  very  easy  to  understand.  The  ideas  of  Skelton,  Gran,  and  others  of 
leaving  out  terms  that  do  not  have  much  to  do  with  the  problem,  such  as  small 
residues  or  well-damped  or  very  high  frequencies,  is  also  very  important.  I 
think  we  have  made  a beginning,  but  only  a beginning,  on  the  problem  of  robust 
control  synthesis.  The  first  thing  you  have  to  realize  is  that  robustness  has 
a trade-off  with  performance.  If  you  want  to  have  a system  that  is  robust  over 
a wide  range  of  some  parameter,  you  are  not  going  to  have  as  good  a performance 
at  the  nominal  value  of  that  parameter.  We  have  a program  at  Stanford  now  that 
was  put  together  by  Leroy  Lee,  a young  man  from  Boeing  who  came  to  study  with 
us,  that  I think  is  a very  Important  forward  step  in  this  direction.  The  idea 
is  to  use  multiple  performance  indices.  If  you  have  parameters  that  vary  over 
a wide  range,  you  minimize  a sum  of  quadratic  performance  indices  evaluated  for 
the  different  values  of  those  parameters.  To  do  that  you  have  to  use  a non- 
linear programming  technique  for  the  parameters  and  feedback  gains.  With  this 
technique  you  can  get  output  feedback  or  low-order  compensator  designs  for 
rather  high-order  systems  very  rapidly  and  efficiently.  In  cases  where  a robust 
controller  design  gives  poor  system  performance,  you  are  going  to  have  to  go  to 
adaptive  control,  which  involves  some  kind  of  identification,  to  get  better  per- 
formance. One  of  the  first  steps  toward  actually  doing  this  with  a piece  of 
hardware  was  presented  in  the  paper  by  Eldred  and  Schaechter  yesterday,  which  I 
thought  was  a very  interesting  paper.  But  this  is  only  a beginning.  They  were 
identifying  one  frequency  and  the  problem  has  three  frequencies.  The  problem 
also  has  mode  shapes  which  affect  the  modal  control  distribution  matrices.  So 
we  have  got  some  way  to  go  yet.  Finally,  I certainly  agree  with  Breakwell  that 
ground  experiments  can  give  us  a lot  of  information  cost-effectively  as  compared 
to  putting  things  up  in  orbit.  We  ought  to  explore  that  much  harder  than  we 
have.  Vacuum  chambers  could  be  used  perhaps  to  get  rid  of  the  damping  due  to 
the  air.  Large  enough  experiments  should  be  used  so  that  we  do  not  have  big 
scaling  effects,  which  are  very  hard  to  extrapolate. 

P.  C.  HUGHES:  I think  one  possibly  can  identify  four  times  when  partial  differ- 

ential equations  are  appropriate  to  use.  The  first  one  is  when  you  are  dealing 
strictly  symbolically  with  the  equation.  We  saw  that  in  the  very  first  paper 
by  L.  Meirovitch,  the  last  paper  by  G.  Rodriguez,  and  many  papers  in  between. 

It  is  just  as  easy  to  write  down  the  differential  operator  as  it  is  to  write 
down  a matrix.  If  you  feel  more  comfortable  with  PDE's,  or  if  it  gives  you  an 
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analytic  result  that  you  are  aiming  for,  then  1 certainly  would  have  no 
objections  to  such  a thing. 

The  second  case  is  where  the  system  is  very  simple.  In  those  cases,  you 
can  write  down  partial  differential  equations  that  are  relatively  tractable  and 
even  get  solutions  to  them. 

Then  a third  possible  case  I can  see  is  when  one  has  a fairly  complicated 
structure,  but  one  wants  to  do  some  feasibility  and  parametric  studies.  This 
has  been  mentioned  by  other  people  and  we  have  seen  examples  of  it  at  this  work- 
shop --  for  example,  a large  antenna  reflector,  which  contains  a lot  of  ribs  and 
mesh.  If  you  neglect  a number  of  things,  you  can  get  an  approximate  partial 
differential  equation,  which  has  some  characteristics  that  are  nicer  than  a 
finite-element  model.  Namely,  the  PDE  model  usually  has  two  or  three  very 
simple  parameters,  which  makes  it  very  nice  for  parametric  studies.  But,  it  is 
usually  a rough  approximation,  unless  it  is  made  more  exact  for  the  first  two  or 
three  modes  by  comparison  to  some  more  accurate  model. 

The  fourth  case  is  where  one  has  dense  periodic  structures,  such  as  the 
example  presented  by  J.  N.  Juang  this  morning,  where  you  have  a truss  which  has 
hundreds  or  thousands  of  little  tiny  members  all  in  a periodic  lattice.  You  can 
smear  that  into  a continuum,  with  very  little  loss  in  accuracy. 

These  are  four  cases  for  which  I personally  feel  that  partial  differential 
equations  are  quite  reasonable.  The  thing  that  I am  trying  to  speak  out  against 
particularly  is  a feeling  expressed  by  some  people  that  partial  differential 
equations  are  more  advanced  than  discrete  models,  or  that  they  are  automatically 
more  accurate,  or  that  they  will  disclose  all  sorts  of  extremely  important  new 
phenomena  that  discrete  models  will  not. 

There  are  a couple  of  more. comments  on  control  of  large  space  structures 
that  I would  like  to  make.  I would  draw  an  analogy  with  cancer  research,  but 
not  in  the  way  you  might  think.  In  spite  of  all  the  significant  research  being 
reported  on  in  hundreds  and  hundreds  of  papers  every  year,  there  does  not  seem 
to  be  a tremendous  amount  of  progress.  I think  the  reason  for  that,  in  part, 
may  be  that  the  progress  is  not  necessarily  in  the  right  direction,  although 
that  is  very  hard  to  judge.  Another  reason  that  I think  we  may  not  be  making  as 
much  progress  is  that  the  problem  keeps  getting  harder,  because  we  keep  making 
it  harder.  Holt  Ashley  referred  to  it  several  times  last  night  in  his  talk, 
and  several  other  people  have  mentioned  it  at  this  workshop.  I would  like  to 
mention  some  specific  examples  of  where  we  are  making  the  problem  too  hard. 

The  first  one  is  disturbance  models,  which  are  not  mentioned  very  often  in 
technical  papers.  The  control  system  in  the  first  instance  is  supposed  to  con- 
trol against  the  disturbance.  In  fact,  the  disturbances  in  space  tend  to  be 
quasi-steady.  Solar  radiation  pressure  and  gravity  gradient  disturbances,  for 
example,  tend  to  have  a frequency  content  with  a period  of  one  orbit.  So  what 
is  causing  the  disturbance?  In  most  cases,  the  control  system  itself.  What 
happens  in  space  is  that  the  rigid-body  modes  are  the  things  that  are  unstable. 
They  are  very  gently  perturbed  by  quasi-steady  forces,  and,  in  an  attempt  to 
control  them,  we  unwittingly  or  unintentionally  excite  the  spacecraft  structural 
modes.  Therefore,  our  aim  should  be  to  control  rigid  modes,  without  exciting 
the  structural  modes. 
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Another  way  we  make  things  hard  for  ourselves  is  that  we  ignore  the 
inherent  stability  of  the  system.  Everybody  is  very  fuzzy  about  damping.  I 
would  say  half  of  the  control  papers  assume  zero  damping.  We  tend  to  Ignore 
simple  passive  solutions  like  passive  damping.  Composite  materials  provide  a 
tremendous  opportunity  to  design  passive  damping  ipto  the  system. 

Lastly,  we  make  things  difficult  with  our  actuator  schemes  that  tend  to 
excite  the  system  more  than  they  really  should.  If  the  disturbances  are  really 
so  low  in  magnitude,  and  are  so  nearly  constant,  then  the  ideal  actuator  would 
be  one  with  a very  low  force  and  very  nearly  constant.  I will  give  you  a very 
specific  example  of  that.  Suppose  you  are  north/south  station-keeping  one  of 
these  satellites.  The  major  force  on  the  satellite  is  due  to  the  moon.  The 
magnitude  of  the  force  is  very,  very  gentle  but  acts  continuously  over  a long 
period  of  time.  The  ideal  control  system  would  be  an  actuator  force  acting 
constantly  with  that  very  low  force  to  exactly  counteract  the  force  of  the  moon. 
We  do  not  do  that.  We  wait  until  the  motion  builds  up  and  then  we  fire  at  a 
node  of  the  orbit  with  a big  bang  and  get  everything  all  shaken  up. 

I would  like  to  make  specific  recommendations.  Firstly  — and  this  is  the 
wrong  group  to  mention  this  to  because  it  is  not  primarily  a structural  dynamics 
audience  — we  have  to  get  good  damping  models.  Secondly,  we  should  instrument 
all  possible  satellites  to  get  information  on  structural  modeling,  particularly 
damping  Information.  Thirdly,  we  should  continue,  and  intensify,  ground  experi- 
ments, particularly  the  ones  that  have  the  rigid-body  modes.  And  lastly,  we 
have  to  develop  some  very  benign  variable-force,  linear  actuators. 

W.  E.  VANDER  VELDE;  My  first  area  will  be  the  issue  of  dealing  with  unreliabil- 
ity in  these  control  systems.  Of  the  thirty  some  papers  that  were  presented  in 
this  workshop,  only  one  dealt  even  with  one  small  aspect  of  that  problem.  Maybe 
that  was  justified  because  the  principal  focus  of  this  workshop  was  on  distrib- 
uted system  issues.  But  this  was  also  essentially  true  of  the  other  workshops 
that  have  taken  place  during  the  course  of  this  summer.  I think  this  really  is 
a true  indication  of  the  amount  of  research  support  that  is  being  given  to  the 
question,  as  it  applies  specifically  to  large  space  structure  control.  There  is 
some  work  going  on  related  to  fault-tolerant  computers,  and  in  the  past  there 
has  been  a reasonable  amount  of  research  support  for  failure  detection  and 
isolation  as  it  applied  in  the  context  of  aircraft  flight  control  systems.  But 
the  large  space  structure  control  problem  really  is  different.  Take  failure 
detection  just  as  one  part  of  what  one  needs  to  do;  the  same  problems  of  inade- 
quate modeling  that  make  control  difficult  make  failure  detection  difficult 
also.  In  fact,  modeling  inadequacy  is  even  a more  serious  problem  in  failure 
detection  than  it  is  in  designing  a control  system. 

The  number  of  components  in  these  large  space  structure  controls  is  simply 
overwhelming  and  assures  that  we  will  suffer  failures.  It  is  clear  that  we 
have  to  do  something  about  it.  This  whole  area  is  one  that  ought  to  be  empha- 
sized much  more  in  NASA's  research  planning.  The  other  area  that  seems  to  me 
important  and  probably  Insufficiently  addressed  at  the  moment  is  the  question 
of  system  identification  after  deployment  in  orbit.  That  strikes  me  as  a very 
difficult  problem. 

For  some  cases,  it  may  not  be.  necessary.  However,  there  will  be  mission 
situations  wherein  the  a priori  Information  that  we  have  obtained  may  not  be 
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adequate.  I envision  an  Initial  period  In  orbit  during  which  the  system  is 
exercised  in  one  way  or  another  for  system  identification  which  would  allow  us 
to  tune  up  our  knowledge  of  the  system  dynamic  parameters. 

The  first  step  in  addressing  it  would  be  to  determine  the  most  efficient 
way  to  parametrize  the  system.  It  is  at  that  step  that  continuum  modeling  may 
have  a role  to  play.  It  strikes  me  that,  however  the  parametrization  is  done, 
there  may  well  be  thousands  of  parameters  to  fully  characterize  the  system 
dynamics  at  all  the  sensor  and  actuator  locations.  It  may  be  almost  an  unfea- 
sible problem  to  solve.  At  the  beginning  of  the  mission,  the  spacecraft  would 
only  have  minimal  capability  because  it  would  not  be  controlled  very  well  due 
to  model  uncertainties.  Most  of  the  onboard  data-processing  resources,  or  the 
telemetry  resources,  may  be  available  for  the  identification  problem  during  that 
period,  and  later  on,  of  course,  they  can  serve  what  is  ultimately  a more  useful 
function.  I would  like  to  suggest  that,  in  addition  to  that  initial  period  of 
full-up  identification,  there  be  a second  mode  of  identification  to  periodically 
monitor  the  dynamic  performance  of  the  system  throughout  the  mission.  It  would 
have  to  be  done  with  an  approach  which  is  far  less  demanding  on  computational 
resources.  The  idea  would  be  to  do  a test  which  would  indicate  when  something 
had  gone  wrong,  perhaps  Identifying  what  parameter  had  changed  enough  to  require 
a new  system  identification  test.  This  approach  is  very  much  like  the 
component-failure  detection  problem.  In  effect,  you  are  identifying  that  a 
particular  component  sensor  or  an  actuator  has  changed  its  performance  charac- 
teristics significantly. 

Comments  and  Questions  from  the  Audience 


D.  G.  WASHBURN,  Air  Force  W^eapons  Laboratory  (AFWL) : Dr.  Breakwell  has  recom- 

mended that  NASA  perhaps  go  the  DARPA  route  of  creating  a model  and  distributing 
it  to  the  community  for  control  of  large  space  structure  research.  I think  that 
is  a good  idea,  and  I am  planning  to  do  a similar  thing  for  AFWL  problems.  I 
would  recommend  that  they  create  a PDE  model  so  that  people  can  treat  it  with 
several  approaches,  including  NASTRAN.  The  current  DARPA  model  absolutely 
forces  modal  control,  because  that  is  the  only  thing  it  provides. 

I would  like  to  add  another  area  where  PDF's  might  be  useful:  that  of 

controller  design.  I certainly  do  not  advocate  that  an  exact  PDE  model  be 
built  for  any  structure.  But  just  from  a strictly  philosophical  point  of  view, 
our  controllers  are  extremely  simple  and  involve  10,  20  states,  or  whatever. 

The  controllers  finally  designed  are  generally  very  simple  and,  philosophically, 
it  might  be  appropriate  to  base  those  on  a simple  model  which  is  an  approxi- 
mate PDE. 

In  the  case  of  AFWL  large  space  structures  with  large  beam  expanders  for 
space  laser  applications,  there  are  very  real  disturbances  that  are  not  just 
created  by  the  controller.  I think  in  most  applications  you  have  disturbances 
Induced  by  slewing.  For  AFWL  applications,  slew- induced  disturbances  will  be 
severe.  There  is  also  broadband  coolant-induced  jitter  on  the  secondary  mirror, 
which  will  feed  into  the  primary  structure.  It  will  go  out  to  200  Hz  practi- 
cally flat.  So  you  will  have  continual  excitation  of  modes  up  to  at  least 
200  Hz.  The  whole  thing  is  hooked  to  a rocket  engine,  called  a laser,  sitting 
back  behind  it,  which  is  shaking  everything  pretty  violently.  So,  at  least  for 
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some  applications  there  are  genuine  broadband  disturbances  that  you  are  just 
going  to  have  to  deal  with. 

This  is  the  first  time  I have  come  in  contact  with  this  group,  but  it  is 
almost  unbelievable  that  nobody  has  sat  down  and  taken  the  simplest  possible 
example,  and  worked  through  and  designed  a controller  based  on  PDE  theory,  and 
then  designed  a modal  controller  and  compared  results  either  analytically  by 
simulation  or  experimentally.  We  are  all  offering  opinions  as  to  the  merits 
and  drawbacks  of  each  method,  but  we  have  not  even  worked  through  this  simple 
example  to  provide  a common  basis  for  comparison.  I recommend  that  NASA  under- 
take such  a task. 

Question  from  the  Audience  — Would  Dr.  Hughes  expand  on  his  reasons  not  to  use 
PDE?  I think  he  said  that  when  the  structure  is  simple,  then  the  PDE's  are 
okay,  but  when  the  structure  is  complex,  then  PDE's  are  not  useful.  There  are 
two  parts  to  my  question; 

a)  What  do  you  mean  by  complex?  Do  you  mean  a large  number  of  PDE's, 
or  a large  number  of  boundary  conditions  that  need  to  be  matched? 

Why  is  the  system  being  complex  a reason  not  to  use  the  PDE  model 
per  se?  Is  it  numerical  algorithm  problems? 

2)  Can  you  give  me  an  example  of  a project  or  some  reference  in  the 
literature  in  which  PDE  overkill  has  occurred? 

P.  C.  HUGHES:  Well,  there  are  several  sources  of  complexity:  one  is  complex 

boundaries;  a second  one  is  a large  number  of  individual  structural  members;  and 
a third  one  is  where  you  have  spatially  varying  inertia  or  stiffness  properties 
that  lead  to  variable  coefficients.  If  you  put  all  these  complexities  together, 
which  is  the  case  in  most  structures,  you  find  that  you  have  a real  mess  on  your 
hands,  and  I for  one  would  not  use  partial  differential  equations  to  get  what 
might  be  called  the  most  exact  model.  If  you  want  to  do  some  simple  approxi- 
mations and  get  an  approximate  model  for  some  feasibility  studies  or  perhaps 
some  simple  parametric  variations,  that  is  different.  After  you  have  got  these 
partial  differential  equations,  how  are  you  going  to.  solve  them?  Are  you  going 
to  use  hundreds  of  hyperbolic  sines  and  cosines,  or  what  are  you  going  to  do 
with  them?  Your  second  question  was,  have  I an  example  of  overkill?  No.  The 
people  that  advocate  them  are  smart  enough  not  to  use  them  in  cases  where  I 
would  not  use  them. 

A.  V.  BALAKRISHNAN:  May  I point  out  that  in  control  system  theory  we  do  not 

solve  PDE  equations.  That  is  a common  misunderstanding.  We  do  not  solve  par- 
tial differential  equations  in  control  systems  design,  any  more  than  we  do 
ordinary  differential  equations. 

A.  N.  MEYSTEL,  University  of  Florida;  I would  like  to  attract  attention  to  one 
of  the  problems  that  has  not  yet  been  mentioned.  All  of  our  control  systems 
work  well  for  stabilization,  but  any  slewing  will  create  something  unexpected 
and  all  these  stabilizers  will  be  ruined.  Any  slewing  of  position  of  an  antenna 
in  space,  especially  quick  changes,  will  require  a type  of  controller  that  we 
are  now  not  thinking  about. 


612 


Now  let  us  consider  another  matter.  The  door  of  the  shuttle,  as  we  all 
know,  warped  due  to  thermal  gradients.  The  same  will  happen  to  large  space 
structures.  Some  parts  are  in  the  sun,  and  some  parts  are  shadowed.  These 
temperature  fields  will  create  something  unexpected  and  difficult  to  model.  I 
believe  that  we  are  dealing  with  a future  set  of  systems  in  which  it  will  be 
impossible  to  determine  the  parameters  with  the  accuracy  required  to  meet  the 
performance  objectives.  It  may  be  necessary  to  use  a new  class  of  systems, 
called  intelligent  control  systems,  which  are  not  only  self-organizing  but  also 
involve  special  modeling  of  unpredictability  of  behavior.  There  are  several 
groups  in  the  United  States,  including  the  University  of  Florida,  who  are  work- 
ing in  this  direction. 

A.  N.  MADIWALE,  MIT  Lincoln  Laboratory:  I would  like  to  recommend  an  emphasis 

on  a particular  topic.  It  is  generally  acknowledged  that  there  are  going  to 
be  large  uncertainties  in  the  modeling  process.  There  should  be  more  effort  on 
stochastic  models  and  designs  that  account  for  uncertainties. 

R.  R.  STRUNCE,  Charles  Stark  Draper  Laboratory,  Inc.  (CSDL) : There  are  some 

very  important  lessons  to  be  learned  from  history.  The  concept  of  feedback, 
for  example,  was  motivated  by  a real  problem.  That  was  to  design  low-output 
distortion  for  transmission-line  amplifiers.  It  was  the  extensive  interaction 
between  laboratory  experiments  and  theoretical  development  that  resulted  in  the 
Bode  frequency  response  diagrams  and  the  Regeneration  Theory  by  Nyquist  for 
example.  If  we  are  going  to  resolve  some  of  these  large  space  structure  prob- 
lems, we  will  need  that  same  type  of  natural  interaction  between  laboratory 
experiments  and  the  theoretical  development.  Take,  for  example,  the  Explorer  I. 
It  is  intuitively  obvious  to  the  most  casual  observer  that  it  is  a relatively 
rigid  body.  Yet  those  four  very  flimsy  antennas  destabilized  that  vehicle  and 
resulted  in  a plethora  of  structural  dynamic  interaction  research.  I think  that 
some  of  these  very  large  space  structures  may  be  headed  for  some  unforeseen 
problems  that  could  result  in  catastrophic  failures.  This  particular  community 
must  seek  out  those  people  who  hold  the  purse-strings  within  DoD  and  NASA  and 
educate  them  as  to  the  necessity  for  an  evolution  of  integrated  experimental 
and  theoretical  development  to  give  us  the  necessary  knowledge  to  bring  these 
large  space  structures  to  fruition. 

Comment  from  the  Audience  — At  this  workshop,  it  has  been  tacitly  assumed  that 
you  will  eventually  discretize  to  implement  any  control  that  you  generate  with 
a PDE  analysis  or  some  other  distributed  system  analysis.  That  seems  to  be 
based  on  the  assumption  that  the  processing  will  be  digital.  The  evolution  of 
high-speed  digital  processing  has  been  overtaken  by  developments  in  optical 
processing  which  are  not  digital  or  discrete  in  time  at  all.  Is  it  not  possible 
that  the  estimators  and,  perhaps,  some  other  parts  of  the  control  might  remain 
in  the  continuous  domain  and  be  implemented  by  analog  processing? 

H.  A.  REDIESS:  The  question  of  real-time  processing  was  brought  up  earlier  by 

Gupta.  It  certainly  is  true  that  there  are  some  new  optical  processing  tech- 
niques that  perform  vector/matrix  operation  directly  in  real-time,  which  will 
make  certain  types  of  modern  control  theory  computations  very  attractive. 

A.  E.  BRYSON,  JR.:  In  Breakwell's  thesis,  he  took  the  partial  differential 

equation  approach  to  design  a controller  for  the  simple  beam  equation.  His 
full-state  feedback  solution  amounts  to  an  Integral  of  a gain  kernel  against  a 
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deflection  as  a function  of  distance  along  the  beam,  plus  another  integral  of 
a gain  and  the  velocities  times  the  velocities  along  the  beam.  So  the  problem 
becomes  one  of  finding  the  gain  kernel  as  a function  of  the  distance  along  the 
beam.  You  can  imagine  performing  this  integration  optically  with  a TV  camera 
and  an  optical  integration  device.  However,  if  you  compare  it  with  taking  a few 
finite  elements,  it  is  pretty  easy  to  convince  yourself  that  you  are  not  going 
to  get  a significant  improvement  going  into  this  integral,  with  a nice  smooth 
kernel,  compared  to  one  with  a few  discrete  elements. 

W.  E.  VANDER  VELDE:  One  application  that  might  be  a good  experiment  of  continu- 

ous modeling  would  be  the  electrostatically-shaped  mesh  antenna  that  we  heard 
about  Thursday  morning.  That  is  a case  in  which  the  geometry  might  be  simple 
enough  so  that  a PDE  description  would  be  tractable.  And,  in  fact,  you  could 
approximate  a distributed  force  on  that  antenna  quite  closely. 

H.  A,  REDIESS:  It  would  be  difficult,  if  not  impossible,  to  draw  a consensus 

viewpoint  from  a group  of  such  a diverse  nature.  I will  attempt  to  synthesize 
what  appear  to  be  the  most  generally  endorsed  recommendations  for  future  research 
resulting  from  this  workshop: 

1)  Develop  control  law  design  and  analysis  methodology  for  continuum 
and  piecewise-continuum  models  for  large  space  structures.  Addi- 
tional research  is  needed  to  mature  the  continuum  model  methodology 
to  a point  where  it  can  be  applied  effectively  on  realistic 
configurations, 

2)  Conduct  or  sponsor  a comparative  study  of  controllers  designed  using 
continuum  and  finite- element  models.  Use  a simplified  model  of  a 
realistic  structure  that  could  be  tested  in  a laboratory.  Compare 
the  results  of  both  designs  with  experimental  data  obtained  by  a 
third  party. 

3)  NASA  should  create  one  or  preferably  more  "benchmark"  generic  models 
representative  of  configurations  for  potential  future  NASA  missions 
that  can  be  used  to  develop  and  evaluate  system  identification  and 
control  law  methodologies.  Both  continuum  and  finite-element  models 
should  be  constructed  for  each  "benchmark"  configuration. 

4)  Develop  and  validate  system  identification  techniques  suitable  for 
identifying  a large  space  structure  in  the  initial  phase  of  operation 
and  for  continuous  monitoring  during  the  entire  mission  to  detect 
changes.  Both  continuum  and  finite-element  model  approaches  should 
be  investigated, 

5)  Conduct  and/or  sponsor  experiments  for  system  Identification  and 
control  law  methodology  development  and  evaluation.  Use  simple 
laboratory  experiments  to  evaluate  identification  techniques  and 
control  laws.  Consider  larger-scale  experiments  in  a vacuum  chamber 
to  minimize  the  effects  of  air  on  damping.  Use  all  possible  space 
flights  as  experiments  to  increase  knowledge  of  structural  dynamics 
in-  space,  e.g.,  instrument  satellites  or  the  Shuttle  Remote  Manipula- 
tor System  to  measure  damping.  Proceed  with  space  shuttle  flight 
experiments  with  flexible  structures. 
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6)  Develop  adaptive  and/or  reconf igurable  control  laws  for  large  space 
structures  to  compensate  for  model  errors,  spacecraft  configuration 
changes,  or  system  failures. 
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Jet  Propulsion  Laboratory 
California  Institute  of  Technology 
Pasadena,  CA  91109 


INTRODUCTION 

The  ability  to  stabilize  and  control  large  structures  in  space  is  vital  to 
establishing  efficient  earth-orbiting  systems  of  the  size  required  for  many  of 
the  future  applications  and  science  missions.  Large  space  systems'  configura- 
tions with  low  mass  densities  are  more  accurately  modeled  as  a continuously  dis- 
tributed mass  over  the  entire  structural  area  (distributed  parameter  system  or 
continuum  model)  rather  than  a sequence  of  finite  mass  elements  coupled  together 
(lumped  parameter  system  or  finite-element  model).  By  far,  the  most  effort  put 
forth  in  studying  these  challenging  control  problems  to  date  has  been  based  on 
the  finite- element  model,  which  can  be  addressed  using  ordinary  differential 
equations  control  theory.  Substantial  theory  now  exists  to  treat  distributed 
parameter  systems  and  some  effort  is  directed  towards  control  of  large  space 
structures.  This  approach  involves  formulation  of  the  problem  in  partial 
differential  equations  and  results  in  solutions  in  infinite-dimensional  Hilbert 
space  which  must  be  approximated.  Fewer  control  analysts  are  familiar  and 
comfortable  with  the  distributed  parameter  systems  approach;  hence,  it- has  not 
been  explored  to  the  same  degree  as  the  finite-element  approach. 

The  term  distributed  systems  was  used  in  two  different  senses  at  the 
workshop.  One  sense  was  control  theory  for  distributed  parameter  system 
modeling  as  discussed  above.  The  second  sense  was  control  theory  for  systems 
requiring  spatially  distributed  multipoint  sensing  and  actuation.  Many  large- 
space-structure  configurations  being  considered  will  require  multisensors  and 
actuators  distributed  in  some  fashion  throughout  the  structure.  One  must 
consider  the  spatial  distribution  of  sensors  and  actuators  whether  treating 
the  problem  from  the  finite-elements  or  continuum  model  approach. 

The  main  objectives  of  the  workshop  were: 

• to  provide  a forum  for  exchanging  ideas  and  exploring  the  application 
of  the  control  theory  for  distributed  systems  to  the  control  of  large 
flexible  multibody  spacecraft;  and 
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• to  identify  the  important  unsolved  problems  of  current  interest 

leading  to  possible  future  collaborative  NASA/university/industry  efforts. 

The  workshop  was  organized  with  several  sessions  addressing  the  major 
technical  Issues  through  invited  and  contributed  papers  by  leading  researchers , 
panel  discussions  and  a final  wrap-up  session  to  extract  the  key  unsolved 
problems  and  recommendations  for  future  research.  The  workshop  agenda  is 
presented  in  Appendix  A.  The  list  of  attendees  and  participants  is  presented 
in  Appendix  B. 

The  purpose  of  this  report  is  to  present  a technical  evaluation  of  the 
workshop  that  synthesizes  the  most  important  results,  conclusions  and  recommen- 
dations for  future  research.  The  future  research  recommendations  are  presented 
in  the  next  section  followed  by  the  technical  evaluation. 

FUTURE  RESEARCH  RECOMMENDATIONS 

The  major  recommendations  presented  here  are  believed  to  represent  a 
consensus  view  from  the  workshop  and  draw  heavily  on  the  results  of  the  wrap-up 
panel  discussion.  One  should  not  conclude,  nor  is  there  any  intent  to  imply, 
that  all  participants  of  the  workshop  or  the  wrap-up  panel  discussion  endorsed 
these  recommendations.  However,  the  recommendations  listed  under  the  heading 
of  "Major"  were  those  summarized  at  the  end  of  the  wrap-up  panel  discussion  and 
were  generally  endorsed  by  the  workshop  participants . 

Several  additional  recommendations  that  were  made  by  various  participants 
are  listed  under  the  heading  of  "Other  Recommendations."  Some  of  those  do  not 
address  directly  the  theme  of  the  Workshop,  but  do  apply  to  large  space  systems 
and  were  felt  to  be  of  sufficient  importance  to  Include. 

Major  Recommendations  for  Areas  of  Future  Research 

1.  Control  law  design  and  analysis  methodology  for  continuum  and  piecewise 
continuum  models  for  large  space  structures . 

Rationale:  Proponents  of  the  continuum  model  approach  believe  that  it  will 

provide  a better  physical  insight  to  the  control  problem  and  solution  since  it 
is  based  on  physical  laws  governing  structural  dynamics  rather  than  a functional 
representation  of  the  dynamics  that  may  obscure  some  fundamental  properties 
of  the  problem,  e.g.,  stability  and  robustness.  It  may  be  a better  way  to 
assure  robustness  of  the  controller  design  because  it  does  not  arbitrarily 
limit  the  number  of  modes  included  as  in  the  finite-element  approach.  Further- 
more, the  proponents  state  that  the  mathematical  formulation  of  the  model  and 
control  problem  is  simpler,  even  for  piecewise  continuum  models  in  which  there  are 
combinations  of  distributed  and  lumped  parameters.  Additional  research  is 
needed  to  mature  the  continuum  model  methodology  to  the  point  where  it  can  be 
applied  effectively  on  realistic  configurations. 
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2.  Comparative  study  of  controllers  designed  using  continuum  and  finite-element 
models . 


• Use  a simplified  model  of  a realistic  structure  that  could  be  tested 
In  a laboratory. 

• Have  each  controller  designed  by  experts  in  the  corresponding  method. 

• Compare  results  of  both  with  experimental  data  obtained  by  a third 
party . 

• Assess  results  based  on  controller  performance  robustness,  plant 
identif lability , computational  efficiency  and  other  appropriate  measures. 

Rationale:  A well-structured  study  of  this  nature  is  needed  to  assess  the 

relative  advantages  and  disadvantages  of  each  approach.  Many  claims  and 
concerns  are  voiced  at  workshops  and  conferences  about  both  approaches  without 
appropriate  comparative  data  to  substantiate  the  positions  voiced. 

3.  NASA  should  create  one  or  preferably  more  "benchmark"  generic  models 
representative  of  configurations  for  potential  future  NASA  missions  that  can  be 
used  to  develop  and  evaluate  systems  identification  and  control  law  methodologies. 

• Both  continuum  and  finite-element  models  should  be  constructed  for 
"benchmark"  configuration. 

Rationale : There  are  currently  no  realistic  models  available  for  potential 

NASA  missions  that  are  developed  in  sufficient  detail  to  use  in  advanced  iden- 
tification and  control  theory  for  both  continuum  and  finite-element  models.  The 
DARPA  ACOSS  models  are  finite-element  models  and  preclude  the  continuum  model 
approach.  If  such  models  were  developed  and  distributed  to  universities  and 
industry  as  well  as  NASA  centers,  a large  amount  of  research  would  be  focused  on 
NASA's  type  of  problems. 

4.  Systems  identification  research  for  large  space  structures. 

• Apply  to  realistic  type  configurations  (NASA  benchmarks , for 
example) . 

• Consider  both  continuum  and  finite  element  model  approaches. 

• Obtain  experimental  space  flight  data  to  assess  methods  . 

• Develop  techniques  suitable  for  identifying  a space  structure  in  the 
initial  phase  of  operation  and  for  monitoring  it  during  the  remainder 

of  the  mission  to  detect  changes. 

Rationale : It  will  not  be  possible  to  obtain  valid  experimental  test  data  on  the 

ground  for  large,  flexible  space-structures.  It  is  highly  likely  that  modeling 
errors  of  a structure  to  be  deployed  in  space  will  be  sufficiently  large  to  ser- 
iously affect  the  controller  performance.  Appropriate  systems  identification 
techniques  must  be  developed  and  assessed  with  whatever  limited  flight  data  that 
are  available  before  they  are  needed  in  an  actual  mission.  (See  Appendix  C for 
an  expanded  statement  on  systems  identification  research  by  Professor  W.E.  Vender 
Velde  of  the  Massachusetts  Institute  of  Technology  (MIT) . 


621 


5.  Experiments  for  systems  Identification  and  control  law  methodology  develop- 
ment and  evaluation. 

• Simple  laboratory  experiments  to  evaluate  identification  techniques 
and  control  laws . 

• Consider  larger-scale  experiments  in  a vacuum  chamber  to  minimize  the 
effects  of  air  on  damping. 

• Use  all  possible  space  flights  as  experiments  to  increase  knowledge 
of  structural  dynamics  in  space,  e.g.,  instrument  satellites  to  measure 
damping . 

• Consider  using  the  space  shuttle  Remote  Manipulator  System  (RMS)  in 
an  on-orbit  experiment  to  compare  predicted  dynamic  characteristics  to 
flight  data. 

• Proceed  with  space  shuttle  flight  experiments  with  flexible  structures 
(i.e.,  MAST,  SAFE,  DAFE,  and  SADE)  to  obtain  structural  dynamics  data  and 
experience  with  stabilization  and  control. 

Rationale:  Experimentation  is  the  surest  way  to  develop  confidence  in  design 

and  analysis  methodologies.  Simple  laboratory  experiments  can  be  very  useful 
in  understanding  the  control  problem.  Some  anticipated  problems  turn  out  not  to 
be  problems  at  all  and  other  new  problems  may  arise  that  were  not  anticipated. 

It  is  extremely  important  to  determine  the  damping  of  structures  in  space  because 
of  the  effect  on  the  controller  design.  Tests  in  a vacuum  chamber  may  help 
get  some  initial  data.  Ultimately,  it  is  necessary  to  obtain  space  flight  data. 
Both  continuum  and  finite-element  models  should  be  used  in  the  analyses,  control- 
ler designs  and  systems  identification  methods  associated  with  the  flight 
experiments . 

6.  Adaptive  and/or  reconf igurable  control  laws. 

• Adaptive  control  laws  to  compensate  for  model  errors,  spacecraft 
configuration  changes  or  system  failures. 

• Reconf igurable  control  laws  to  compensate  for  spacecraft  configuration 
changes  or  system  failures . 

Rationale:  Certain  large  structures  considered  for  space,  such  as  a modular 

space  platform,  would  have  major  configurational  changes  over  its  lifetime. 
Modules  will  be  transported  to  space  via  multiple  shuttle  flights,  then 
assembled  in  space.  Once  a basic  platform  is  assembled,  other  masses  may  be 
attached  periodically,  such  as  the  shuttle  docking  with  the  platform,  attaching 
an  Orbital  Transfer  Vehicle  (OTV),  adding  or  subtracting  fuel,  adding  or  sub- 
tracting experiments  and/or  satellites.  The  inertial  characteristics  and 
dynamics  would  vary  widely  with  these  changes.  Control  laws  that  self-adapt  or 
that  reconfigure  (either  automatically  or  through  manual  software  changes) 
should  be  investigated  that  could  cope  with  these  wide  variations . A second 
motivation  is  to  compensate  for  failures  in  the  control  system.  There  are  likely 
to  be  a large  number  of  components  in  a large  space  structure  control  system. 

If  a component  fails,  or  if  a fault  occurs  in  software  that  can  affect  the  con- 
troller performance,  it  may  be  possible  to  reconfigure  the  remaining  healthy 


622 


elements  of  the  system  to  maintain  adequate,  though  nonoptimal,  control  until 
the  faulty  elements  can  be  corrected.  An  alternate  approach  and  further  discus- 
sion of  reliability  issues  are  presented  in  the  next  recommendation. 

7.  Control  system  fault  tolerance  research. 

• Choice  of  number,  location  and  control  laws  to  accommodate  likely 
faults  over  the  expected  operational  lifetime  (including  effects  of 
periodic  component  repair  or  replacement). 

• Systems  performance  and  fault  monitoring  (hardware  and  software)  , 
failure  detection  and  isolation  (FBI),  fault -tolerant  processing  and  data 
distribution  and  control  law  reconfiguration. 

Rationale : The  control  and  stabilization  system  for  large  space  structures  is 

likely  to  have  several  hundreds  of  components  dispersed  over  the  structure  and 
linked  together  by  some  form  of  data  distribution  networks . Even  with  extremely 
reliable  components , there  are  bound  to  be  numerous  component  failures  or  even 
software  faults  that  have  the  same  effect  as  a hardware  failure  over  the  life- 
time of  the  spacecraft.  The  system  must  have  the  capability  of  accommodating 
these  faults  in  some  manner.  (See  Appendix  D for  an  expanded  statement  of 
reliability  issues  by  Professor  W.E.  Vander  Velde  of  MIT.) 

Other  Recommendations 

1.  Finite-dimensional  compensator  design  methodology 

• Model  order  reduction 

• Reduced  order  compensator  design 

• Robustness  techniques  with  truncated  modes 

• Direct  digital  design  and  implementation 

• Fixed- form,  low-order  compensator  design  via  nonlinear  programming 

2 . Sensor  and  actuator  technology 

• Very  low  frequency 

• Very  broad  bandwidth 

• Proportional  actuator  from  very  small  to  large  forces  (benign 
actuators ) 

• Long  life,  low  mass,  and  low  power 

3.  Real-time  processor  control  laws 

• Need  10  to  100  mllllon-operatlon-per-'second  capability 

• High  reliability 

• Software  languages,  verification,  validation,  and  fault  tolerance 
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4.  Hardware-in-the-loop  simulation 

• Digital,  analog  or  hybrid 

• Partial  or  complete  system 

5.  Modeling  accuracy  for  large  space  structures 

• Upper  atmosphere  model 

• Disturbance  model 

6.  Rendezvous  and  docking  control  and  guidance 

7.  Control  laws  for  very  large  amplitudes  due  to  slewing  or  major  disturbance, 

8.  On-orbit  vibration  tests  of  operational  spacecraft  to  determine  model 
characteristics  similar  to  ground  vibration  tests  on  aircraft. 

9.  Systematic  method  for  selecting  the  weighting  matrices  in  estimation  and 
filtering  problems . 


TECHNICAL  EVALUATION  OF  WORKSHOP 

The  workshop  must  be  considered  an  unqualified  success.  There  were  over  150 
attendees  and  participants  (see  Appendix  B)  who  included  most  of  the  key  re- 
searchers in  the  U.S.  concerned  with  this  subject.  In  general,  the  technical 
papers  were  of  excellent  quality  and  pertinent  to  the  theme.  Structuring  the 
workshop  to  have  discussion  panels  each  day  was  an  Important  factor  to  achieving 
lively  participation  of  the  audience  in  discussions. 

The  workshop  was  designed  to  stimulate  a strong  exchange  of  views  between 
two  segments  of  the  control  research  community:  those  supporting  the  use  of 

finite-element  models  and  those  supporting  the  use  of  continuum  models.  It  was 
very  effective  in  doing  just  that.  Strong  viewpoints  were  expressed  on  the 
advantages  and  disadvantages  of  both  approaches.  It  appeared  to  bring  out  all 
the  major  issues  associated  with  applications  of  distributed  system  theory  to 
the  control  of  large  space  structures.  The  workshop  was  also  very  effective  in 
surfacing  important  unsolved  problems  worthy  of  future  research. 

A very  stimulating  and  entertaining  after-dinner  talk  was  given  by  Professor 
Holt  Ashley  of  Stanford  University.  He  drew  heavily  on  his  experience  in  air- 
craft structural  dynamics  and  aeroelasticity  to  project  what  he  believes  are  the 
key  factors  to  address  in  large,  flexible  space-structures. 

The  following  sections  discuss  each  session  of  the  workshop  in  sequence. 

The  final  section  gives  some  general  observations  and  comments  developed  over 
the  three-day  workshop. 
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Session  I : Modeling  and  Control 


This  session  addressed  the  fundamental  questions  of  mathematical  modeling 
of  large  space  structures  for  control  purposes.  The  papers  were  largely 
tutorial  in  nature  and  identified  what  the  authors  believed  to  be  the  major 
features  of  each  modeling  and  control  law  design  approach. 

Meirovitch's  paper  recommends  the  Independent  modal-space  control  (IMSC) 
method,  which  was  illustrated  for  control  of  distributed  and  discretized  struc- 
ture models.  Several  advantages  are  listed  for  the  IMSC  approach  as  compared 
to  coupled  controls,  including  lower  control  energy  and  provable  robustness. 
Robustness  was  only  discussed  in  terms  of  spillover  modes  and  not  with  respect 
to  uncertainties  of  changes  in  the  overall  structure  or  controller.  It  would  be 
of  Interest  for  the  IMSC  method  to  be  applied  to  a more  realistic  example. 

Hughes'  paper  presents  strong  arguments  against  the  "Infinite  Modes 
Assertion"  as  being  the  true  characterization  of  structures,  in  terms  of  both 
its  importance  and  the  assertion  itself.  It  is  a very  readable  exposition  on 
what  the  author  believes  are  absurd  assertions  made  by  certain  control 
theorists  and  applied  mathematicians:  that  physical  structures  actually  have 

an  infinite  number  of  modes.  Much  of  the  argument  centers  around  the  concept 
of  a mathematical  model  vs.  the  real  physical  structure.  The  main  point 
is  that  models  based  on  partial  differential  equations  or  ordinary  differential 
equations  are  both  mathematical  approximations  to  the  real  structure  and  either 
is  only  as  good  an  approximation  as  is  shown  to  be  useful  in  a practical 
application.  The  paper  expresses  the  viewpoint  that  a modal  representation  with 
relatively  few  modes  is  the  most  effective  method  for  realistic  large  space 
structures.  A very  effective  criterion  for  selecting  the  modes  to  include 
in  an  analysis  of  control  design  problems  was  presented.  The  paper,  however, 
does  not  seem  to  recognize  that  a continuum  model  may  be  a better  mathematical 
model  of  the  physical  system  than  a finite-element  model  for  certain  situations. 
The  author  did  acknowledge  that  point  in  a later  panel  discussion. 

Gran's  paper  presents  an  interesting  middle  ground  position  between  the 
continuum  and  finite- element  model  enthusiasts.  The  paper  advocates  designing 
large  space  structures  using  finite  element  methods  and  verifying  the  design's 
robustness  using  a continuum  model.  The  paper  presents  a concept  of  stable 
and  unstable  interactive  modes  that  is  very  important  in  selecting  which  modes 
should  be  retained  in  a truncated  model.  A pole-zero  set  near  the  jm-axis  in 
the  complex  plane  is  unstably  interacting  if  the  zero  is  above  the  pole.  The 
paper  states  that  stably  interacting  modes  with  small  contribution  to  the  cost 
may  be  discarded,  but  beware  of  unstably  Interacting  modes  because  of  potential 
robustness  problems.  Several  other  practical  suggestions  and  guidelines  are 
presented  for  designing  satellite  control  systems  in  a systematic  way . 

The  final  paper  of  the  session  by  Hamidl,  Rodriguez  and  Schaechter  presents 
a solid  case  for  modeling  complex  large  space  structures  as  coupled  rigid  and 
flexible  bodies  using  discrete  models  for  the  rigid  bodies  and  continuum  models 
for  the  flexible  bodies.  Contrary  to  statements  by  other  participants  that  one 
could  not  use  a continuum  model  for  complex  spacecraft,  this  paper  shows  that  a 
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very  simple  and  compact  mathematical  model  can  be  constructed.  The  model 
retains  sufficient  dynamics,  provides  good  physical  insight  to  the  problem,  and 
is  suitable  for  parametric  studies.  The  technique  was  illustrated  by  modeling 
a 55-meter  wrap-rib  antenna  with  an  L-shape  boom  connecting  the  dish  to  the 
feed.  The  model  was  in  good  agreement  with  a finite  element  model  and  required 
very  little  computational  effort. 

A participant  from  the  audience,  Larry  Weisstein,  supported  the  idea  of 
continuum  models  for  even  complex  space  structures.  He  referred  to  his  masters 
thesis*  which  presents  a continuum  modeling  approach  for  lattice-type  structures 
that  are  very  common  elements  in  space  systems.  The  reference  also  includes 
an  extensive  bibliography  of  continuum  models  for  space  structures. 

Session  II:  Control  and  Stabilization 

Session  II  consisted  of  eight  papers  presenting  a series  of  examples  of 
modeling  and/or  control  for  a variety  of  configurations.  Rather  than  reviewing 
each  paper  individually,  some  of  the  major  points  made  are  highlighted  here. 

Bryson's  paper  makes  a simple,  but  potentially  very  important,  point.  If 
a flexible  structure  has  a plane  of  symmetry,  it  may  be  possible  to  split  the 
control  problem  into  two  uncoupled  sets.  The  example  used  resulted  in  a greatly 
simplified  task  of  synthesizing  the  active  control  logic. 

The  paper  by  Hamidi  and  Manshadi  addresses  developing  a control  law  that 
optimizes  the  overall  performance  objective  of  a large  space  antenna,  that  is, 
the  resulting  RF  pattern,  rather  than  an  intermediate  objective  of  controlling 
dish  shape  and  feed  position.  By  including  an  RF  electromagnetic  model  as 
well  as  the  structural  model  in  the  design  loop,  the  control  of  the  relative 
feed/dish  motion  can  be  synthesized  to  maximize  the  peak  electric  field. 

Considerable  discussion  was  generated  over  Henderson's  presentation  of  the 
ACOSS**  Model  #2.  Most  participants  were  generally  familiar  with  ACOSS  and 
endorsed  the  idea  of  such  models.  Several  requests  were  made  about  how  to 
acquire  the  model.  Interested  parties  can  contact  Timothy  C.  Henderson  at  the 
Charles  Stark  Draper  Laboratory  (CSDL).  The  discussions  led  to  recommendations 
that  NASA  develop  similar  Benchmark  models  for  its  potential  missions. 

Two  realistic  examples  of  distributed  control  of  large  space  antennas 
(hoop/column  and  wrap-rib)  were  presented  in  the  final  paper  of  this  session 
by  Cameron,  Hamidi,  Lin  and  Wang.  They  found  that  distributed  sensing  and 
control  provides  significant  performance  improvement.  The  study  also  indicates 
that,  for  the  linear  optimal  controller,  system  parameter  errors  could  result 


*Weisstein,  Larry  S.,  Continuum  Models  for  Repetitive  Beamlike  Lattice 
Structures,  masters  thesis,  George  Washington  University,  Washington,  D.C., 
1982. 

**ACOSS  stands  for  Active  Control  of  Space  Structures,  which  is  a DARPA- 
sponsored  program. 
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In  significant  performance  degradation  and  even  instability.  The  paper  recom- 
mends the  use  of  in-flight  systems  identification  for  certain  critical  model 
parameters. 


Session  III:  Discussion  - LSS  Control  Problems 

The  panel  discussion  was  aimed  at  further  definition  of  the  control  prob- 
lems of  large  space  structures.  The  moderator  was  R.R.  Strunce  of  CSDL  and  the 
panel  members  were:  J.  Sesak  of  General  Dynamics,  N.K.  Gupta  of  Integrated 
Systems,  and  A.F.  Tolivar  of  the  Jet  Propulsion  Laboratory  (JPL) . The  panel 
and  participants  from  the  audience  focused  primarily  on  what  they  believed  to 
be  the  major  technology  needs  and  recommended  research.  The  following  are  what 
appeared  to  be  the  most  important  comments: 

• The  control  theory  and  control  law  design  methodology  have  not  yet 
been  developed  to  the  stage  where  one  could  design  with  confidence  a 
stabilization  and  control  system  for  a large,  flexible  space-structure 

to  meet  both  the  performance  and  robustness  requirements.  There  are  still 
issues  of  modeling,  model  order  reduction,  robustness  of  stability,  and 
systems  identification.  It  will  be  necessary  to  demonstrate  the 
technology,  probably  in  the  flight  environment,  to  assure  confidence  in 
the  methods. 

■ There  are  very  little  experimental  data  for  identification  and  control 
of  large,  flexible  space-structures.  The  data  available  are  for  simple 
experiments  which  are  not  very  representative  of  a real  space  structure. 

It  is  extremely  important  to  obtain  better  experimental  data  from  more 
realistic  ground  tests  and  on-orbit  flight  tests. 

• The  robustness  problem  has  not  been  solved  when  one  considers  the 
total  problem  including  actuator  and  sensor  dynamics  and  realistic 
parameter  uncertainties.  The  problem  of  uncertainty  management  through 
systems  identification  techniques  and/or  adaptive  control  should  be  given 
more  attention. 

• There  is  a real  need  for  research  to  develop  actuator  and  sensor 
technology  for  low  frequency  operations  that  are  appropriate  for  space 
applications.  Actuators  will  be  needed  with  force  ranges  from  very  small 
to  very  large. 

• Many  of  the  control  concepts  being  studied  for  large-scale  systems 
would  challenge  the  real-time  processing  technology.  Better  analytical/ 
numerical  techniques,  as  well  as  much  higher  processing  capacity  devices, 
must  be  developed.  It  may  be  necessary  to  have  computational  capability 
of  from  10  to  100  million  operations  per  second.  It  is  not  clear  that 
VHSIC  (Very-High-speed  Integrated  Circuits)  will  even  solve  this  problem. 

• More  attention  must  be  paid  to  the  question  of  assuring  adequate 
performance  of  a stabilization  and  control  system  in  the  presence  of 
component  failures.  Systems  must  be  made  fault- tolerant  to  some  level. 
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Session  IV:  Distributed  Control 

Session  IV  continued  addressing  the  issues  involved  in  control  and  sta- 
bilization of  large  space  structures. 

The  paper  by  Vander  Velde  is  an  excellent  treatment  of  a very  important 
practical  aspect  of  distributed  control  of  large  space  structures,  that  of 
selecting  the  sensor  set  (number  and  location)  considering  possible  failures. 

A methodology  is  presented  that  would  assist  the  designer  in  the  process  of 
selecting  the  sensor  set  that  maximizes  the  degree  of  observability  over  the 
operating  lifetime  of  the  system  accounting  for  the  likelihood  of  failures. 

The  sensor  set  that  one  selects  considering  the  possibility  of  component  failures 
can  be  quite  different  from  that  one  would  select  considering  only  optimizing 
performance.  The  author  has  treated  the  companion  topic  for  actuators  in  a 
separate  paper  presented  elsewhere.  A discussion  of  reliability  issues  by 
Vander  Velde  is  presented  in  Appendix  D of  this  report. 

J.A.  Breakwell's  paper  presents  a strong  case  for  using  laboratory  experi- 
ments for  gaining  a better  understanding  of  the  control  problem  with  flexible 
structures.  Through  his  experiment  with  TOYSAT,  Breakwell  found  that  some 
problems  he  anticipated  from  theory  never  materialized,  but  new  ones  arose. 

The  TOYSAT  experiment  was  designed  to  test  hypotheses  concerning  the  application 
of  optimal  control  theory  to  flexible  spacecraft.  Breakwell  suggests  that 
with  the  relatively  low  cost  of  microcomputers,  almost  anyone  can  afford  to 
conduct  simple  laboratory  experiments  that  can  be  very  useful. 

The  last  paper  of  this  session,  by  Schaechter  and  Eldred,  also  addresses 
laboratory  experimentation  for  verification  of  control  laws.  Two  major  points 
demonstrated  are  that  model  parameter  errors  can  cause  an  optimal  control 
design  to  drive  a system  unstable  and  that  adaptive  control  can  stabilize  a 
system  with  an  uncertain  model.  The  experiment  proved  useful  in  validating 
the  analytical  techniques. 

Two  special  kinds  of  flexible  space  structures  were  analyzed  by  J.V. 
Breakwell:  orbiting  chains  and  rings.  Another  special  case,  vibration 

control  of  a tethered  satellite,  was  discussed  in  the  paper  by  Xu  et  al. 

Session  V:  Control  Theory  for  Distributed  Systems 

Nine  papers  were  presented  in  this  session  which  reviewed  and  treated 
primarily  control  theory  of  distributed  parameter  systems  and  approximation 
methods  to  achieve  finite-dimensional  controllers.  This  session  and  the 
following  discussion  panel  stimulated  a lively  discussion  of  the  pros  and  cons 
of  partial  differential  equation  (PDE)  control  theory  for  large,  flexible  space- 
structures. 

An  excellent  review  of  the  control  theory  of  hyperbolic  systems  was  given 
by  Russell.  The  paper  covers  important  questions  of  controllability,  stabili- 
zation, robustness,  canonical  forms  and  eigenvalue  placement.  The  paper 
presents  a very  promising  method  for  canonical  compensation  for  hyperbolic 
systems . 
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Gibson's  paper  illustrates  how  conceptually  easy  it  is  to  treat  continuum 
systems  in  infinite-dimensional  space  rather  than  being  more  complex  as  some 
have  suggested.  Once  the  infinite-dimensional  compensator  is  designed  concep- 
tually, then  a finite-dimensional  approximation  is  made.  This  approach  provides 
more  insight  into  the  nature  of  the  control  problem  than  the  finite-element 
approach.  A simple  example  is  presented  to  illustrate  the  method.  It  now 
needs  to  be  applied  to  a model  more  representative  of  real  space  structures. 

The  paper  by  Hyland  and  Madiwale  presents  a new  look  at  a fixed  order 
controller  design  method  which  allows  for  incorporating  system  parameter 
uncertainties  directly  in  the  design  process.  It  is  based  on  a finite-dimen- 
sional (modal  truncation)  model.  Explicit  optimality  conditions  are  derived 
for  minimum  data/maximum  entropy  stochastic  model  of  the  system  parameter 
uncertainties.  A major  feature  of  this  approach  seemed  to  be  the  ability  to 
directly  secure  robust  stability  and  mean-^square  performance  optimality. 

Mackay's  paper  solves  for  the  optimal  compensator  of  a distributed  param- 
eter system  in  terms  of  the  kernel  of  a Volterra  integral  operator.  A finite- 
dimensional  approximation  to  the  optimal  compensator  is  thus  obtained. 

The  paper  by  Chen  et  al.  addresses  the  very  important  and  little-studied 
problem  of  numerical  methods  for  solution  of  control  systems  governed  by 
hyperbolic  PDE.  The  discussion  that  followed  the  paper  recognized  the  impor- 
tance of  the  problem,  not  only  for  PDE,  but  also  for  finite-element  control 
laws  because  of  the  high  dimensionality.  There  was  substantial  support  ex- 
pressed for  much  more  research  in  numerical  techniques. 

The  two  main  results  of  Salas'  paper  are  stability  criteria  in  both  the 
time  and  frequency  domains  for  closed-loop  distributed  parameter  systems  with 
finite-dimensional  controllers.  The  stability  analysis  of  the  system  uses  the 
theory  of  semigroups  on  the  infinite-dimensional  state  space.  The  criteria  is 
not  constrained  by  the  controller  design  approach,  e.g.,  finite-dimensional 
reduced— order  models.  The  author  stresses  the  point  that  closed-loop  stability 
analysis  must  be  a part  of  any  attempt  to  produce  finite-dimensional  controllers 
for  Infinite-dimensional  systems. 

Skelton*  was  unable  to  attend  the  workshop  and  present  his  paper.  Peter 
Hughes  graciously  offered  to  summarize  the  main  points  of  Skelton's  "Analysis 
of  Structural  Perturbations  via  Cost  Decomposition  Methods,"  since  he  was  a 
close  associate  and  familiar  with  Skelton's  work.  The  paper  presents  a method 
of  analysis  of  linear  dynamical  systems  by  decomposition  of  some  functional  ex- 
pansion of  the  system  such  that  it  is  possible  to  measure  the  contribution  of 
each  component  to  the  cost  function.  It  would  be  effective,  for  example,  in 
model  order  reduction  of  a modal  representation  of  a large  space  structure  or  in 
constructing  a reduced-order  controller  from  an  optimal  regulator  design.  The 
discussion  of  this  method  generally  concluded  that  it  was  very  effective.  One 
cautionary  note  was  raised  with  respect  to  what  Gran  termed  "unstably  interacting 
modes.  The  cost  decomposition  method  might  indicate  that  such  a mode  could  be 
discarded,  because  of  insignificant  contribution  to  the  cost  with  nominal  system 
parameters,  but  a small  change  in  some  parameter  could  lead  to  the  control  system 
driving  that  mode  unstable. 

*Skelton's  paper  was  in  Session  VII,  not  V,  but  is  discussed  here  because  it  fits 
this  topic  better. 
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Session  VI:  Discussion  - LSS  Applications  of  Distributed  System  Theory 

The  panel  discussion  was  aimed  at  identifying  the  role  that  the  control 
theory  for  distributed  systems  can  play  in  the  control  of  large  space  structures. 
The  moderator  was  R.S.  Gran  of  Grumman  Aerospace  and  the  panel  members  were: 

D.C.  Washburn  of  the  Air  Force  Weapons  Labs  (AFWL) , V.  Komkov  of  West  Virginia 
University,  D.L.  Russell  of  the  University  of  Wisconsin  and  J.S.  Gibson  of 
the  University  of  California,  Los  Angeles  (UCLA)-.  The  following  are  what 
appeared  to  be  the  most  important  comments: 

• In  selecting  a model  to  represent  a large  space  structure  one  must 
consider  what  the  purpose  is  of  the  model.  One  may  want  a different  model 
for  design  and  analysis  than  for  real-time  control.  All  mathematical 
models  are  approximations.  A major  objective  of  research  is  to  attempt  to 
determine  what  are  the  best  models  for  various  purposes.  Considerable 
research  has  been  conducted  on  finite-element  models.  More  research  is 
needed  on  PDE  modeling  to  determine  where  and  for  what  purposes  they  are 
most  useful. 

• Truncating  a PDE  model  before  solving  the  control  problem  may  be 
throwing  away  Important  information  and  obscuring  some  of  the  fundamental 
properties  of  the  problem.  If  the  PDE  control  problem  is  solved  and  then 
approximated,  one  might  expect  something  different,  e.g.,  robustness  may 
be  achieved  more  easily. 

• If  a PDE  model  or  a compound  PDE/ODE  model  best  describes  the  physi- 
cal laws  of  the  structural  dynamics,  and  one  chooses  to  use  some  form  of 
a functional  expansion  or  finite-element  model,  then  one  may  lose  the 
physical  insight  into  the  control  problem.  Fundamental  phenomena  may  be 
lost  and  the  control  properties  may  be  very  different.  Once  a controller 
is  designed,  the  closed-loop  performance  should  be  analyzed  using  the 
best  physical  law  model  and  should  assure  that  assumptions  have  not  been 
violated. 

• There  is  a real  need  to  estimate  better  the  damping  in  large  space 
structures  and  include  realistic  damping  in  controller  designs. 

• Structural  dynamicists  regularly  use  PDE  models  to  do  parameter  studies 
of  relatively  simple  structures.  Finite-element  models  are  used  in  the 
final  design  of  complicated  systems  such  as,  for  example.  Voyager.  When  the 
hardware  is  built,  vibration  testing  is  conducted  to  verify  the  dynamics  and 
update  the  controller,  if  necessary.  Predicting  the  damping  is  particularly 
difficult  for  real,  complex  systems  because  of  joints  and  nonlinearities. 

The  argument  between  finite  element  and  continuum  models  may  be  academic  be- 
cause the  real  errors  come  about  due  to  the  structure  not  obeying  any  sort 
of  equation. 

• PDE  models  can  be  very  useful  in  "thought  experiments"  which  can  help 
clarify  the  mental  processes  that  one  would  apply  to  the  analysis  or 
control  of  a distributed  parameter  system.  They  help  in  the  thinking 
process  before  doing  massive  number  crunching  on  a computer. 
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Session  VII:  Estimation/Identif ication 


The  last  session  of  formal  papers  dealt  with  the  questions  of  systems 
identification,  parameter  estimation  and  model  error  estimation.  There  were 
five  papers  presented  on  these  subjects  and  all  but  one  addressed  continuum 
models . 

The  importance  of  this  subject  was  emphasized  several  times  during  the 
workshop.  A statement  of  systems  identification  in  large  space  structure 
control  by  Vander  Velde  is  included  in  Appendix  C.  A consensus  of  the  workshop 
participants  appeared  to  support  the  need  for  increased  research  to  assure  that 
techniques  are  adequately  developed  and  tested  before  a large  structure  is 
deployed  in  space . 

The  paper  by  Hendricks  et  al.  presents  a simple  study  of  two  methods 
(forced  and  free  vibrations)  and  a standard  least-squares  numerical  procedure 
to  extract  damping  and  stiffness  parameters  for  a lumped  parameter  model.  A 
more  interesting  study  would  have  been  to  include  unmodeled  effects  (spillover) 
and  measurement  noise. 

Banks'  paper  presents  a good  theoretical  discussion  of  spline-based  approx- 
imation schemes  for  parameter  estimation  algorithms  for  distributed  parameter 
systems . Several  numerical  examples  have  been  conducted  to  show  the  effective- 
ness of  the  schemes  although  these  were  not  reviewed  in  much  detail  at  the 
workshop.  The  technique  now  needs  to  be  tried  on  a more  realistic  model  of  a 
space  structure. 

One  of  the  most  impressive  papers  of  this  session,  and  possibly  the  entire 
workshop,  was  the  Juang  and  Sun  paper,  "Parameter  Estimation  in  Truss  Beams 
Using  Timoshenko  Beam  Model  with  Damping."  Simple  continuum  models  to  represent 
large  truss  beams  and  truss  platforms  are  formed,  then  discretized  for  use  in  a 
least  square  identification  problem.  This  results  in  a greatly  reduced  number 
of  parameters  that  have  to  be  Identified  over  using  a truncated  finite-element 
model  of  the  entire  structure.  A numerical  example  for  a truss  beam  shows 
excellent  comparison  between  the  actual  model  and  the  simple  continuum  model. 

The  final  paper  of  the  workshop  by  Rodriguez  formulates  a model 
error  estimation  technique  for  elliptic  systems  with  a specific  application  to 
a large  space  antenna  static  shape  determination.  The  simple  geometric  shape 
of  the  antenna  allowed  a simple  PDE  formulation  and  sequential  solutions  which 
are  combinations  of  "forward"  filtering  and  "backward"  smoothing.  The  paper 
shows  how  conceptually  simple  the  PDE  formulation  is . 


Session  VIII:  Discussion  - Future  Research  Opportunities 

The  wrap-up  panel  discussed  the  state  of  distributed  system  theory  and 
applications  to  the  control  of  large  space  structures  and  identified  potential 
research  opportunities  for  NASA  consideration.  It  also  provided  a forum  for  all 
participants  to  comment  on  the  workshop  topics  and  contribute  to  recommendations. 
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The  moderator  was  H.A.  Rediess  of  Mllco  and  the  panel  members  were: 

A.V.  Balakrishnan  of  UCLA,  A.E.  Bryson,  Jr.,  of  Stanford,  J.A.  Breakwell  of 
LMSC,  P.C.  Hughes  of  the  University  of  Toronto,  and  W.E.  Vander  Velde  of  MIT. 
Some  of  the  discussion  and  conclusions  reached  were  reiterations  of  comments 
made  during  the  first  two  panel  discussions.  At  the  conclusion  of  the  panel, 
Rediess  summarized  the  major  recommendations  to  NASA,  which  are  listed  in  the 
section  on  future  research  recommendations  and  are  not  repeated  here.  The 
following  are  what  appeared  to  be  the  most  important  comments: 

• On  the  central  question  of  the  workshop  on  the  use  of  continuum 
models  vs.  finite-element  models,  there  were  three  principal  questions — 

1.  Most  practical  large  space  structure  configurations  are 
too  complicated  to  model  as  a distributed  parameter 
system  and  it  is  best  to  use  a finite-element  one  . 

2.  Practical  large  space  structures  cannot  only  be  modeled  as 
distributed  parameter  systems  (or  in  combination  with  lumped 
parameter  elements)  but  also  the  PDE  control  theory  results 
in  a simpler  formulation,  potentially  better  control  laws 
and  better  physical  insight  into  the  problem. 

3.  Finite-element  models  have  proven  to  be  effective  in 
designing  spacecraft  whereas  continuum  models  have  not 
yet  been  proven  to  be  as  useful.  Research  should 
continue  on  the  continuum  model  approach  because  if 
there  is  to  be  a major  advancement  in  control  theory, 
it  most  likely  will  come  from  that  approach. 

• On  the  question  of  bandwidth  and  controller  complexity,  there  were 
two  distinctly  different  viewpoints  expressed — 

1.  There  is  only  a limited  bandwidth  that  needs  to  be 
considered  in  any  real  system;  hence,  a reduced  order 
model  is  sufficient.  Linear  Quadratic  Gaussian  (LQG) 
design  is  always  too  complex.  A reduced-order  controller 
or  a fixed,  low  order  compensator  is  a better  approach. 

2.  A truncated  model  with  limited  bandwidth  may  throw  away 
important  information  needed  for  robustness  considerations 
and  obscure  certain  fundamental  characteristics  of  the 
system.  It  is  preferable  to  design  a controller  based  on 
infinite-dimensional  LQG  to  assure  performance  and  robust- 
ness, and  then  to  discretize  for  implementation. 

• PDE  estimation  and  control  theory  is  at  the  state  of  development  when 
it  should  be  tested  on  major,  realistic  space-structure  problems  and  com- 
pared to  the  finite-element  approach.  The  comparison  should  include  ex- 
perimental evaluation,  if  possible. 
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• The.  problem  of  assuring  robustness  in  the  controller  design, 
whether  based  on  continuum  or  finite -element  models,  is  still  a major 
concern  and  needs  much  more  study.  Several  participants  believed  PDE 
is  the  best  approach  for  treating  robustness. 

" Simple  laboratory  experiments  are  now  affordable  and  can  be  very 
useful  in  understanding  the  control  problem  and  model  order  reduction. 

» Systems  identification  for  large  space  structures  in  orbit  is  going 
to  be  a very  difficult  task.  Because  of  modeling  uncertainties,  it  will 
be  necessary  to  identify  key  parameters  in  space  when  the  structure  is 
first  deployed  in  order  to  "tune  up"  the  control  laws  before  the  system 
goes  into  operation.  Another  kind  of  identification  will  be  needed  to 
monitor  the  system's  characteristics  throughout  the  mission  lifetime  to 
detect  changes  that  could  cause  control  system  instabilities  (see 
Appendix  C) . 

« Reliability  issues  for  large  space  structure  distributed  control  and 
stabilization  systems  have  not  been  adequately  addressed  in  considering 
control  law  design.  Such  systems  are  expected  to  have  hundreds  of 
components,  and  failures  are  likely  to  occur.  Controller  designs  that 
account  for  potential  failures  may  be  quite  different  from  those  that 
assume  no  failures  (see  Appendix  D) . 

• The.  space  environment  is,  for  the  most  part,  very  benign,  and  natural 
disturbances  tend  to  be  quasi-static  within  a period  of  one  orbit.  The 
Biajor  disturbances  that  excite  structural  modes  tend  to  be  man-made:  im- 
pulsive actuators,  slewing,  orbit  changes,  docking,  mass  changes  and 
astronaut  operations. 

• One  of  the  most  important  technology  developments  needed  for  large 
space  structure  control  is  an  accurate  method  to  predict  and  model 
damping.  Passive  damping  should  not  be  ignored  in  design  of  a large 
space  structure. 


General  Observations  of  the  Reviewer 

During  the  course  of  the  workshop,  and  in  preparing  this  evaluation 
report,  the  reviewer  developed  the  following  general  observations  about  the 
state  of  distributed  system  theory  and  applications  to  control  of  large  space 
structures : 


• Deficiencies  in  control  theory  are  probably  not  critical  in  the 
development  of  near-term  space  structures.  However,  advances  in  control 
theory  and  technology  have  the  potential  of  providing  design  methods  and 
control  laws  which  could  result  in  more  efficient  large  space  systems 
in  the  future  and  may  be  necessary  for  structures  exceeding  a few 
hundred  meters  in  dimension. 
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• One  should  not  argue  that  distributed  parameter  control  theory  is 
better  or  worse  than  lumped  parameter  control  theory  but  rather  think  of 
them  as  complementary  methodologies . They  are  both  tools  which  the 
designer /analyst  should  use  where  they  are  most  useful.  It  appears  that 
distributed  parameter  control  theory  may  be  very  useful  in  the  early 
conceptual  analysis  and  investigating  control  law  concepts.  It  appears 
that  finite -element  models  may  be  a most  effective  way  to  design  and 
analyze  the  extremely  complicated  final  space  system  to  be  deployed. 

Where  appropriate,  one  should  use  a compound  model  of  finite-elements  and 
distributed  parameter. 

In  view  of  the  fact  that  distributed  parameter  system  control  theory 
has  not  been  developed  to  anywhere  near  the  same  level  as  lumped  parameter 
system  control  technology,  it  seems  wise  to  accelerate  research  in  the 
applications  of  distributed  parameter  control  for  large  space  structures. 
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APPENDIX  A - WORKSHOP  AGENDA 


WEDNESDAY , 
8:00  a.m. 


10:00  - 
12 : 00  noon 


12:00  noon 

1:00  - 
4:00  p-m- 


JULY  14 

Registration 

Welcome  to  JPL  - General  C.H.  Terhune,  Jr. 

Acting  Director,  Jet  Propulsion  Laboratory 

Introduction  to  Workshop  - H.A.  Rediess,  Milco  International,  Inc. 

NASA  LSS  Missions  and  Control  Requirements  - A.F.  Tolivar,  Jet 
Propulsion  Laboratory 

Break 

SESSION  I:  MODELING  AND  CONTROL 

CHAIRMAN:  P.K.C.  Wang,  University  of  California,  Los  Angeles 

Modeling  and  Control  of  Distributed  Structures 

L.  Meirovltch,  Virginia  Polytechnic  Institute  and  State 
University 

Space  Structure  Vibration  Modes:  How  Many  Exist?  Which  Ones 

Are  Important? 

P.C.  Hughes,  University  of  Toronto 

Control  of  Flexible  Structures:  A Systematic  Overview  of  the 

Problem 

R. S.  Gran,  Grumman  Aerospace  Corporation 

Distributed  System  Modeling  of  a Large  Space  Antenna 

M.  Hamidi,  G.  Rodriguez,  and  D.B.  Schaechter,  Jet  Propulsion 
Laboratory 

Lunch 

SESSION  II:  CONTROL  AND  STABILIZATION 

CHAIRMAN:  R.R.  Strunce,  Charles  Stark  Draper  Laboratories,  Inc. 

Modeling  of  Flexible  Structures  for  Active  Control 
A.E.  Bryson,  Jr. , Stanford  University 

Vibration  Suppression  in  Large  Space  Structures 

T.K.  Caughey  and  C.J.  Goh,  California  Institute  of  Technology 

Control  of  Antenna-Feed  Attitude  and  Reflector  Vibrations  in  Large 
Spaceborne  Antennas  by  Mechanical  Decoupling  and  Movable  Dampers 
P.K.C.  Wang,  Jet  Propulsion  Laboratory;  and  E.C.  Hong  and  J.C. 
Sarina,  University  of  California,  Los  Angeles 
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4:45  - 

6:00  p.m. 


THURSDAY, 

8:00  a.m. 
11:20  a.m 


A Closed-Loop  Principal  Component  Analysis  of  a Tetrahedral  Truss 
E.A.  Jonckheere,  University  of  Southern  California 

Control  of  Large  Space  Antennas  Based  on  Electromagnetic  - 
Structural  Models 

M.  Hamidi  and  F.  Manshadi,  Jet  Propulsion  Laboratory 
Break 

Active  Control  of  Space  Structures  (ACOSS)  Model  2 
T.C.  Henderson,  Charles  Stark  Draper  Laboratories,  Inc. 

Travelling  Wave  Effects  in  Large  Space  Structures 
A.  von  Flotow,  Stanford  University 

Distributed  Control  of  Large  Space  Antennas 

J.  Cameron,  M.  Hamidi,  Y.H.  Lin,  and  S.J.  Wang,  Jet  Propulsion 
Laboratory 

Break 

SESSION  III:  DISCUSSION  - LSS  CONTROL  PROBLEMS 

Participants:  R. R.  Strunce,  Charles  Stark  Draper  Laboratories, 

Inc.,  Moderator;  J.  Sesak,  General  Dynamics;  N.K.  Gupta,  Inte- 
grated Systems,  Inc.;  and  A.F.  Tolivar,  Jet  Propulsion 
Laboratory 

JULY  15,  1982 

SESSION  IV:  DISTRIBUTED  CONTROL 

CHAIRMAN:  A.F.  Tolivar,  Jet  Propulsion  Laboratory 

Number  and  Placement  of  Control  System  Sensors  Considering 
Possible  Failures 

W.E.  Vender  Velde,  and  C.R.  Carignan,  Massachusetts  Institute 
of  Technology 

The  TOYSAT  Structural  Control  Experiment 

J.A.  Breakwell  and  G.J.  Chambers,  Lockheed  Missiles  and  Space 
Company 

Large  Space  Structure  Model  Reduction  and  Control  System  Design 
Based  upon  Actuator  and  Sensor  Influence  Functions 
Y.  Yam,  J.H.  Lang,  T.L.  Johnson,  S.  Shih,  and  D.H.  Staelin, 
Massachusetts  Institute  of  Technology 

A Design  Procedure  for  Active  Control  of  Beam  Vibrations 
S.L.  Dickerson  and  G.  Jarocki,  Georgia  Institute  of  Technology 

On  Vibration  Control  of  Tethered  Satellite  Systems 
D.M.  Xu  and  A.K.  Misra,  McGill  University;  and  V.J.  Modi, 
University  of  British  Columbia 
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Break 


Orbiting  Chains  and  Rings 

J.V.  Breakwell,  Stanford  University 

Hardware  Verification  of  Distributed/Adaptive  Control 
D.  Eldred  and  D.B.  Schaechter,  Jet  Propulsion  Laboratory 

Lunch 

12:30  p.m.  - SESSION  V:  CONTROL  THEORY  FOR  DISTRIBUTED  SYSTEMS 

3:20  p.m.  CHAIRMAN:  Lt,  Col.  D.C.  Washburn,  Kirtland  Air  Force  Base 

Some  Remarks  on  the  Current  Status  of  the  Control  Theory  of 
Single  Space  Dimension  Hyperbolic  Systems 
D.L.  Russell,  University  of  Wisconsin 

Approximation  in  Control  of  Flexible  Structures,  Theory  and 
Applications 

J.S.  Gibson,  University  of  California,  Los  Angeles 

Simultaneous  Control  and  Optimization  for  Elastic  Systems 
V.  Komkov,  West  Virginia  University 

Robust  Control  of  Higher  Order  Systems  Using  Positivity 
R.J.  Benhabib  and  R.P.  Iwens,  TRW  Space  Technology  Group 

Fixed-Order  Dynamic  Compensation  through  Optimal  Projection 
D.C.  Hyland  and  A.N.  Madiwale,  Massachusetts  Institute  of 
Technology  Lincoln  Laboratory 

Approximation  of  the  Optimal  Compensator  for  a Large  Space 
Structure 

M.K.  Mackay,  University  of  California,  Los  Angeles 

Finite  Difference  Numerical  Methods  for  Boundary  Control  Problems 
Governed  by  Hyperbolic  Partial  Differential  Equations 
G.  Chen,  Q.  Zheng,  M.  Coleman  and  S.  Weerakoon,  Pennsylvania 
State  University 

*Stable  Feedback  Control  of  Distributed  Parameter  Systems:  Time 

and  Frequency  Domain  Conditions 
M.J.  Balas,  Rensselear  Polytechnic  Institute 

*Analysls  of  Structural  Perturbations  in  Systems  via  Cost 
Decomposition  Methods 
R.E.  Skelton,  Purdue  University 


Break 


*These  papers  were  presented  in  Session  VII  but  the  subjects  are  more  suitable 
for  Session  V. 
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6:30  p.m.  - 
9:30  p.m. 

FRIDAY,  JULY 

8:00  a.m.  - 
11:30  a.m. 


1:00  p.m.  - 
5:00  p.m. 


SESSION  VI:  DISCUSSION  - LSS  APPLICATIONS  OF  DISTRIBUTED  SYSTEM 

THEORY 

Participants:  R.S.  Gran,  Grumman  Aerospace  Corporation, 

Moderator;  D.C.  Washburn,  Air  Force  Weapons  Laboratory;  V. 

Komkov,  West  Virginia  University;  D.L.  Russell,  University  of 
Wisconsin;  and  J.S.  Gibson,  University  of  California,  Los 
Angeles 

SOCIAL  HOUR  AND  BANQUET  (Pasadena  Holiday  Inn) 

Speaker:  H.  Ashley,  Stanford  University 

16,  1982 

SESSION  VII:  ESTIMATION/ IDENTIFICATION 

CHAIRMAN:  L.W.  Taylor,  Jr.,  Langley  Research  Center 

Algorithms  for  Estimation  in  Distributed  Models  with  Applications 
to  Large  Space  Structures 
H.T.  Banks,  Brown  University 

Identification  of  Large  Flexible  Structures  Mass/Stiffness  and 
Damping  from  On-Orbit  Experiments 

S.L.  Hendricks,  S.  Rajaram,  M.P.  Kamat  and  J.L.  Junkins,  Virginia 
Polytechnic  Institute  and  State  University 

Break 

Joint  State  and  Parameter  Estimation 

N.  Carmichael,  University  of  Warwick,  and  M.D.  Quinn,  Sheffield 
City  Polytechnic,  United  Kingdom 

Parameter  Estimation  in  Truss  Beams  Using  Timoshenko  Beam  Model 
with  Damping 

C.T.  Sun,  Purdue  University;  and  J.N.  Juang,  Martin  Marietta 
Aerospace 

A Function  Space  Approach  to  State  and  Model  Error  Estimation 
for  Elliptic  Systems 

G.  Rodriguez,  Jet  Propulsion  Laboratory 
Lunch 


SESSION  VIII:  DISCUSSION  - FUTURE  RESEARCH  OPPORTUNITIES 

Participants:  H-A.  Rediess,  Milco  International,  Inc., 

Moderator;  A.V.  Balakrishnan,  University  of  California,  Los 
Angeles;  A.E.  Bryson,  Jr.,  Stanford  University;  J.A.  Breakwell, 
Lockheed  Missiles  and  Space  Company;  P.C.  Hughes,  University 
of  Toronto;  and  W.E.  Vander  Velde,  Massachusetts  Institute  of 
Technology 
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APPENDIX  C - STATEMENT  ON  SYSTEM  IDENTIFICATION  IN 
LARGE  SPACE  STRUCTURE  (LSS)  CONTROL 

W.E.  VANDER  VELDE,  MIT 

One  of  the  most  unsettling  aspects  of  the  prospect  of  controlling  flexible 
structures  in  space  is  the  fact  that  many  configurations  will  not  permit 
experimental  tests  on  the  ground  to  verify  or  improve  the  models  of  their 
dynamic  properties.  In  some  mission  situations,  then,  depending  on  the  band- 
width of  the  controlled  system  necessary  to  meet  the  performance  specifications, 
it  is  virtually  certain  that  the  modeling  errors  upon  initial  deployment  of  the 
assembly  in  space  will  be  great  enough  to  preclude  the  successful  operation  of 
the  required  control  system.  In  that  case,  there  will  have  to  be  an  Initial 
period  of  operation  under  the  control  of  a more  tolerant  system  during  which 
the  dynamics  of  the  structure  are  identified. 

It  is  suggested  that  dynamic  model  Identification  in  the  large  space 
structure  (LSS)  context  be  visualized  as  having  two  modes. 

1.  Complete  system  identification  during  the  initial  phase  of  operation. 

2.  Dynamic  model  monitoring  during  the  remainder  of  the  mission  period 

with  selected  parameters  updated  as  necessary. 

The  first  mode  of  initial  system  identification  is  a massive  undertaking  whose 
purpose  is  to  estimate  the  large  number  of  parameters  necessary  to  define  the 
dynamic  model  required  for  the  control  function.  Because  only  elementary 
control  can  be  exercised  during  this  period , and  special  test  inputs  may  be 
necessary  to  enhance  the  Identlf lability  of  system  parameters,  the  system  will 
not  be  able  to  perform  its  primary  mission (s)  during  this  time.  Thus  a large 
part  of  the  system's  data  processing  capability  can  be  devoted  to  the  identifi- 
cation function  in  this  initial  phase.  Later,  when  the  system  is  in  full 
operation,  the  data  processors  will  be  devoted  largely  to  mission  requirements 
such  as  sensor  signal  processing  and  thus  the  need  for  the  second  mode  of 
dynamics  monitoring  which  must  be  designed  to  place  only  a small  demand  on  the 
data  processor. 

A considerable  amount  of  research  has  been  done  on  the  mode  1 type  of 
problem — complete  system  identification — but  the' LSS  identification  task  is 
far  more  demanding  in  that  such  a large  number  of  parameters  will  have  to  be 
identified.  It  is  not  clear  that  this  task  is  computationally  feasible;  specific 
research  should  be  directed  at  the  LSS  identification  problem  with  the  objective 
of  formulating  an  approach  with  the  best  possible  computational  efficiency. 

The  mode  2 task  of  monitoring  the  dynamics  and  improving  the  estimates  of 
selected  parameters  as  necessary  is  a new  concept  which  deserves  research 
support. 
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APPENDIX  D - STATEMENT  ON  RELIABILITY  ISSUES  IN 
LARGE  SPACE  STRUCTURE  (LSS)  CONTROL 

W.E.  VANDER  VELDE,  MIT 

One  aspect  of  active  control  of  large  space  structures  that  is  receiving 
too  little  attention  is  the  importance  of  the  unreliability  of  control  system 
components . This  is  not  to  say  that  unreliable  components , according  to  our 
best  standards,  will  be  utilized.  Rather,  it  is  the  sheer  number  of  these 
components,  notably  sensors  and  actuators,  which  will  likely  be  required  to 
effect  good  control  over  the  structure  which  assures  that  we  will  have  to  deal 
with  failures.  As  one  example,  if  the  system  has  a total  of  400  sensors  and 
actuators,  each  with  a mean  time  to  failure  of  100,000  hours  (optimistic  by 
today's  standards),,  one  may  expect  to  experience  a component  failure  about 
every  10  days  on  average.  Obviously,  the  system  must  be  designed  to  accommodate 
these  failures  and  continue  to  function. 

There  are  several  aspects  to  the  overall  problem  of  designing  a control 
system  for  fault  tolerance.  The  most  Important  of  these  are: 

1.  Choose  the  number  of  sensors  and  actuators  to  be  incorporated  in 
the  system,  and  decide  to  place  them  on  the  structure,  with  recognition 
of  the  fact  that  many  of  them  are  likely  to  fail  during  the  mission 
period . 

2.  Incorporate  a fault-monitoring  function  in  the  control  system  to 
detect  the  events  of  a component  failure  and  isolate  the  faulty 
component . 

3.  Utilize  a fault-tolerant  signal  transmission  and  data  processing 
system  to  implement  the  controller  and  fault-monitoring  functions . 

4.  Implement  a method  of  reconfiguring  the  control  system,  upon  isolation 
of  a component  failure,  so  as  to  function  without  the  faulty  component. 

5.  Devise  means  for  evaluating  the  likely  performance  of  a proposed 
system  configuration  including  its  failure  accommodating  functions. 

Some  research  has  been  performed  on  all  of  these  subjects,  but  most  of 
it  was  in  the  context  of  other  applications  than  LSS  control.  Some  current 
research  is  being  funded  on  fault-tolerant  computers,  item  3 above,  but  very 
little  support  is  being  given  to  work  on  the  other  items.  Failure  detection 
and  isolation  (FDI) , item  2 above,  has  been  developed  largely  in  the  context  of 
aircraft  flight  control  systems.  But  the  LSS  application  is  different.  The 
same  characteristics  that  make  control  of  LSS  difficult,  such  as  high  dimension- 
ality and  inadequate  modeling,  also  make  FDI  difficult.  But  given  that 
component  failures  are  virtually  certain  to  occur,  these  aspects  of  dealing 
with  component  unreliability  should  be  given  high  priority  in  NASA  planning  for 
research . 
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